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AGCATCTGCT 
TAAATACTTA 
AAAGGAAGCT 
CGACAATTCT 
CTGTTGCAGT 
_ AAAACTAGTC 
TTTACTGAAC 
CATCTGTTGC 
AGGACGGTAG 
AAGCCATTCA 
GACTTTTTGA 
AGCTCCTAGA 
ACCAGCGATC 
CAATACCAGC 
GGACACGAAT 
CCAAGACACC 
ACTCATCAAT 
TTAAACCATA 
AAAGTGACAA 
GCTTTTTTCC 
AAGCCTAGTC 
CCCATAATCA 
ACTTTCTTTC 
GACAGACTCA 
TTAAAGTTGA 
TAACCTGCAA 
AGATTAAAGA 
CCTAAAACAT 
TCATCCAGCA 



TTGACAACAT 
GCTACATGCT 
TGGCGTTCAT 
TCTTTACTTG 
AAGCGAGTCG 
AACTGGTCTC 
CACTGAGGTG 
ATTCCAAAAT 
GAACGATTGT 
CTAGAGCCAA 
CTTCGCTCCT 
TTATGATGCA 
AAGACCGGTC 
TATGATTTTA 
GTCAAAACGG 
TGCATCAAAA 
AATTTCAATA 
GATTATTCAA 
CCATGCCAAG 
GTTCCTTATT 
CATGAAAGAG 
GCATATTTAC 
TGGTTAACAC 
TATGCCAGCG 
TAGGGCTACG 
AGTCAAAGAA 
AGCCACCTGA 
GAGCTAGGAT 
TATCCATCAA 



CTTCTATTGC 
TATCGTAGTT 
TAAAACGAGC 
CCAACTTCTG 
CTGCATATTG 
CATTAAAATA 
AGATAACATA 
ATTGGTTAAG 
ATTTCTCAGC 
AGAAGGGAAC 
TAAAACTATC 
TCTCAGTAGG 
CGAAGATCAT 
TTAGCTGTAT 
TTCTCAATCT 
AGATCTTCAT 
ACTTCTGATT 
GAATCCAGAA 
CAACTGAATC 
GAGCGTTTTT 
TCCATAGGCG 
AATGTAGGCC 
CATCACCATT 
ATTCCAAAAC 
GATTCCCATC 
GAGTTCCAGA 
CTGCAAGGCT 
AAACTTATAC 
CTCATCTCGC 



1073 

CTGATAAGAA AACTGGTCTG 
AACATAGCCT CTAACCGAAA 
CAATAATTCA ATCATTTCAT 
AACCAGGTCC TTCATAGCTA 
ACTAGCCTGA TCCCCAGATT 
CTGCTGGAAG GCTGCTGGAT 
CACAACTTGT TTATTCTCCA 
CGATGCAGCT CCCCCCTGTC 
TAATACCGCA GGATGAGCAC 
AAAACGCACA TTTGGATCAG 
GATAGTAGTA GCCACTGCTG 
ATAAAAGAAA ATGAGCAGAA 
CCATAAGCGT TTAAGCATTT 
TCCAGTCGTC ACGACCAAAC 
CCACAATCAA CTCAACCGTT 
CCATCATGTC AGAAACATCT 
TGATATCCAT ATTTTATTTC 
AAGATTAAGA ATGACAACAT 
CAGCGATTCT CAGGTAGGGC 
TTCTTGCGAA CCCAGGCATC 
ATATAGTACC AGGTCACACC 
ATGCTTGAGG TTACATTACG 
CGCATAAAGA CAAAGTCACG 
TCCTTTAAAT CCCTTGATAA 
AAGTTTGAGA TGGCCAAAGC 
CCAAAAGTAT ACATAACTGC 
AAATTCTTCA GAGGAGGTAG 
AAAAAGCCCC ACATGATATA 
TCAGGAATAG CCTGATAATT 



GCAAGATTTT 
ACTGACCAAA 
TGTCTGCTGT 
CGTTTGGGAA 
GATGTTTCAG 
CATAGCCATT 
GCTGTGGTAA 
CTAAAAGATA 
CGTCAAAACG 
ATAGTGCTCT 
AACGCTTTTC 
AAACCAACAA 
TGTAGCTCCA 
TCTGTTACAG 
CCCATACTAT 
TCCATAAACA 
CTTTTATTTT 
GACAACATGG 
AGCCTTCCCT 
ATTGATGACC 
ATGCCAAAAT 
ATTTTTAAAG 
GAACCAGAAG 
AAAGGGCTTG 
AAACATAGAA 
CAAGGCATAG 
TAAGGTCTCT 
GCGGACAGAT 
TTCATTAAAT 
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CGCTTAAAGC 
AATTCCCAGA 
GAACGAAAGG 
GCTGGTTGCA 
ACCCACTTGC 
CAGCAAAGGT 
ACAATAAAGA 
CCTATAAAGA 
GGCTCTAAAT 
ATCCTTTGAT 
TAGATGGCAT 
TATCGATATC 
GATTTTGTAC 
GAGACTTGTT 
TCTGGAAGTC 
CGCCTGTGTG 
GATTGTTCAT 
CATTTGGCAG 
AGCCGATTGG 
GAGCTACTGT 
CGCGCAGTTT 
TTCCAGGCAT 
GTGTTGATGT 
CCTGAGTGAT 
AATACATGAC 
GTGTCGCAAA 
AAATCTGCAC 
CATCTCCCTT 
CCTGAGCTAG 
TAATCAAGCT 



GATCGATTGG 
GGGTAAAATC 
TCAGGTAAGA 
CCTTGACAAA 
CATCCTTGCT 
AAATACCCAA 
AGAGACTTAC 
TGGGCAAGGT 
GAGGAAGCTG 
GTCAATCTTT 
CATATAGGAC 
TCGCTCAAAC 
CTTGTGGTCC 
GAGGTTTTGA 
CATGCGTCCG 
ATAGGCTGGC 
ATAACCTTTT 
TTTATTTCCT 
TAGGCGTTTG 
CGCTTCTGTT 
TTGAGCTGTT 
TTTCTCACTG 
CCAGATAGCG 
GACTGAAGGA 
AGACAAGTCA 
TTCCTTATCC 
TCCCTTAGGC 
GACTGGATGC 
ATTTATCATT 
TGGCTCTGCT 



ACCACTCGAG 
CTTAATCACT 
AATTCCCAAG 
GATAATCGGA 
TTTTCGATAA 
GGCAGCTAGT 
CAACACTTCA 
TGCAGCAATC 
TTGAAAAAAC 
CCATTTGGAG 
ATCATGATGT 
TGCTCACGAA 
TTGTTATAGC 
GAGACATCTT 
CCGTAGAGAA 
AGATCTTCAA 
GAAACAGCTG 
TCCTCGTCAA 
AGAGTCGCTA 
GGGCCGTAAG 
TTGACCGTCA 
TTGAAGTATT 
ATTGGCAATG 
AGAGTGAAAA 
AAAGAATAAG 
GTAATCATCC 
TTACCAGTCG 
GTGATTTCAT 
GGTGTAGAAA 
ACTTCTAAAA 



1074 
AAAGTTGGCA 



CCATCTCTCA 
AACCCAAGCA 
AGTAGGGACA 
TGCTTGTAGA 
TGATTGGTCT 
TACCAGGCAA 
ACATAAACAA 
TCCATCATCT 
TTAGTGGCAA 
CTGTCAGGTC 
CACCGTCTTT 
GCGGTACTGC 
CTAACTCAAT 
GCAAGCCCTC 
ACTCAAAGAA 
GCCCAGAAAC 
TGATAAAGGT 
ACATCTCGTC 
CATTGATGAT 
ATTCTTCACC 
CAGACAACAT 
AAAAGATAGC 
GCGTACCACC 
GTGGCTGTGC 
AGTTTGTAAA 
TACCAGAAGT 
AGTTATTCCC 
CCTGCTCCAA 
TAGCTGAAAC 



TGAAGAGAAG 
GCTCGATGAC 
AAGACTGCGT 
GAAAACTAAC 
AAAGCAGGAG 
TTCCACCCAC 
AGCGTTTCTT 
AATACTGAGG 
CTTATTCACC 
ACTGTCTCGG 
TTCCTTGATG 
TAAGATGACA 
GACAGCAGAT 
GCGGTAACCG 
ATCTGTCATG 
GGCTTCTGCT 
AATGATTTCT 
TGGAGAATCA 
TGTCACGGCA 
ACGGGCATTT 
ATCAAAGTAG 
GGCCATATCT 
CGCAAAGAGT 
AAGTGCCAAG 
CAGCATTTGC 
GCTGAGGAGA 
AAAGATAATG 
TTGGGCAAAG 
GGGAAAGGCT 
TCGCTCCAAG 



GAAACGGAGG 
AATTCCAACC 
TCCATTGATA 
TAAGTAGAAG 
CAATATTTCC 
CAACATGGTG 
GAAAAAGAGA 
ATTGCCATAT 
TCGTTAATCA 
TAAAGGAATT 
GCCTTGGTAA 
TAAGCCAATA 
TCGATAAAGC 
TTAAACTTAA 
GTTCCCACAT 
GTTTTTTCAG 
CCCTGCTCAC 
GCCTTGGTAT 
ACTGCTGACA 
GGGAAACGCT 
AAATGCGTGA 
GCAAAGGATG 
TGCTTAAAGT 
GTCGGTGCCC 
GGACGACTCG 
TTATCATGTG 
TAGTAATTAT 
GCTTCTTGAA 
GAAATGGCAA 
GCCGAATGGC 
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TATCAATTGG AATGTAGGCA TGACCTGACT TAGTCAGCGC TACAAAGGTT GCCAACATTT 
CATATTCTTG GCCACCAAAA ACAACCACAG GAGACTTCTC AGGCAAGCCT AGTTGGTCAA 
TGACTGCAGC CAAACTATCC GAATCAGCCT TTAAATCGCC ATAAGTGTGT TCCTGCCCCA 
AAACATTATA GACAGGATAG CTAGGCTGTG TCTGAGCAAA ATGCTCAATG GTTTCAATCA 
TATCTGCTAT TGGTTTATTT GACACAATAG GGATTCTCCT TCAAGTTAAA ATTCATTATA 
GATAAAGCTT CCTTGACCCT GACCAAGATA GCTAAAGAAG TAAAGCAGCC CTAGAAAGAT 
AAGAAAATAC AAGGCTGTCC GACCAAGAAA GAGGTACAAT TCTTTTCTCT GTTTCATCAA 
GAAAAACCAT TCATTTCTGT AATTTTTCGC TAAAATAAGA GTGATTCTTA CTAGCTTATT 
TTTCTACCAT TGTACCACTT TATATAGTAT CTTTTCAATT GTTTACCGTA TGTTTCCAAT 
AGATTTCAGC TTATTTTAAG GATTATACAG TTTTTCTATG TATATTTTCA AATAGAGTGA 
TCCTGCTTCA AAACTCCATT TCAGGAGACA ATGAAGTAAA TCTTCCCATA ATAAAACACA 
CAATATCAAG TTTTTTCAAC ACCTGATACT ATGCGCTTTT CTGATTTTTA AAGACTTTTT 
AACCACTCTC TCATTTAAAA TAATCTCGTC TGATATAAAT TAAAATAGCT TCTATCATCA 
GACAAATGGC TGATAGCCAA AAACTGATGC TAATACCAAA ACTCTCAGTA ATATAGCTCA 
TTAGCAAAAC AAATACTGAA AATGCTAATG TAGAAATCAC TTCAAGAACG GAATAGACAT 
TAACTAAATG ATTTTCCTCT ACTGTTTCCT GAAGAAATAC ACTTTCAGGA ACTTCTTTTA 
GTTGCGATAA CATACCAACT AAAGCTGAAA ATAATAAAAA CATCTGTGCG TTTGGAAAAT 
ATAGAATAGT CAGTGTCACT ATTTCCATAG CTACAAGAGG AAAAAGAATA CTTTCCCCCC 
AAATCATTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC TCATGAGCCA 
CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG TTTTTCCTCG 
CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT TTTCATCTCG 
ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA AGGAACAGGA 
AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC GGAATAGGCA 
TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT TCCCACCACG 
TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC TCCAGCTAGA 
TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG AACTTCGTTT 
TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT CTCCTTGATG 
AAATTCTCGG CTTGACCACG TCCACGATAA AGCTGAAACT GGTCTTGGcT gTTCCACTCG 
TCATATTTGT AACGAGAGAA ATAACATCGT AGAACAAGTA TCCTTCTTTT C 
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(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
CCCCTTTGTC AAAACTGTAA AATTAACGAC TCAACAATTC ATCTTTACAC CAATCTCAAT 
GGAAAACAAA AACAAATTGA CCTCTGTCAA AACTGCTATA AGATTATCAA AACAGATCCT 
AACAATAGCC TCTTCAAAGG TATGACGGAT CTGAACAATC GTGACTTCGA TCCCTTTGGT 
GATTTCTTCA ATGATCTAAA CAATTTCAGA CCTTCTAGCA ATACTCCTCC TATTCCCCCA 
ACCCAATCAG GTGGAGGTTA CGGTGGAAAC GGCGGTTATG GTTCCCAAAA TCGTGGATCT 
GCTCAAACTC CGCCACCTAG CCAAGAAAAA GGCCTGCTGG AAGAATTTGG TATTAATGTA 
ACTGAAATTG CCCGTCGTGG AGACATTGAC CCCGTTATTG GGCGCGACGA TGAGATTATC 
CGTGTCATCG AGATTCTCAA TCGTAGAACC AAGAATAATC CTGTCCTTAT CGGTGAACCT 
GGTGTCGGAA AAACGGCCGT TGTCGAAGGT CTAGCTCAGA AAATTGTCGA TGGCGATGTG 
CCACATAAAC TCCAAGGTAA ACAAGTCATC CGTCTGGATG TGGTTAGCTT AGTTCAAGGA 
ACGGGGATTC GAGGACAATT TGAAGAACGC ATGCAAAAAC TCATGGAAGA AATTCGCAAA 
CGTGAAGACA TCATCCTCTT TATCGATGAA ATCCATGAAA TTGTTGGTGC TGGTTCTGCG 
AGTGATGGTA ATATGGACGC AGGAAATATC CTCAAGCCAG CCCTTGCTCG TGGAGAACTG 
CAACTAGTCG GTGCTACTAC CCTCAATGAA TACCGTATCA TTGAAAAGGA TGCTGCCCTC 
GAGCGTCGTA TGCAGCCTGT TAAAGTCGAT GAACCAACGG TGGACGAAAC AATCACTATT 
CTCAAAGGGA TTCAAAAGAA ATACGAAGAT TACCACCACG TTCAATATAC AGATGCTGCG 
ATTGAAGCAG CTGCAACTCT TTCCAATCGC TACATCCAAG ATCGCTTCTT GCCTGACAAG 
GCCATTGACC TCCTAGATGA AGCTGGTTCT AAGATGAACT TGACCTTGAA TTTTGTGGAT 
CCTAAAGTAA TTGATCAGCG CTTGATTGAG GCTGAAAATC TCAAGTCTCA AGCTACACGA 
GAAGAAGATT TTGAGAAGGC GGCCTACTTC CGCGACCAGA TTGCCAAGTA TAAGGAAATG 
CAAAAGAAAA AGATCACAGA CCAGGATACT CCTAGCATCA GCGAGAAAAC TATTGAGCAC 
ATTATCGAGC AGAAAACCAA TATCCCTGTT GGTGATTTGA AAGAGAAAGA ACAATCTCAA 
CTCATCCATC TAGCCGAAGA TCTCAAGTCT CATGTTATTG GTCAAGATGA TGCAGTCGAT 
AAGATTGCCA AGGCTATTCG CCGTAATCGT GTCGGACTTG GTACCCCTAA CCGCCCAATC 
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GGAAGCTTCC TCTTCGTTGG GCCAACTGGT GTCGGTAAGA 

GCTATCGAAC TTTTTGGTTC TGCTGATAGT ATGATTCGCT 

GAAAAACATA GTGTGGCTAA GTTGGTCGGC GCTCCTCCAG 

GCTGGTC AAT TAACTGAAAA AGTTCGCCAC AATCCATATT 

GTGGAAAAAG CTCACCCAGA TGTTATGCAC ATGTTTCTTC 

TTGACAGACG GGCAAGGACG CACCGTTAGC TTCAAGGATG 

AATGCAGGTA CAGGAAAGAC CGAAGCTAGC GTTGGATTTG 

ACCAATTCTG TCCTCGGTGA ACTCGGTAAC TTCTTTAGCC 

GATGGCATTA TCGAATTTAA GGCTCTCAGC AAGGATAACC 

ATGCTAGCAG ATGTTAACAA GCGCCTCTCT AGCAACAACA 

AAGGTCAAGG AAAAGTTGGT TGACCTAGGT TATGATCCAA 

CGTCGGACTA TTCAAGACTA TATTGAGGAC ACAATCACTG 

AGCGAAAAAG ATCTCAAAGC AGTTATGACT AGCAAGGGAA 

AAAAAAGCTG AAGTTAAAAG TTCTGAAAAA GAAAAATAAA 

AAATGAAATT TTTCTGCTTC TTTTTTTACT AAAATAACTG 

CTTTGTCCAT TATGATATAT AGTAGACTGA ATCTGAAATA 

CATTTATAGA AATTAATTTT ACTTTCCCAA TCGATTTGTT 

TATAGTCAAT TGAAACAAGA ACAAGACAAA AGAGCCTCAT 

AATACCTTTT TGAGGTGCTT TTTGATATGA GCCCATGTTT 

GGTGAGTAGG GAGGAAGAGG TAAAAGTTTA TACCCAAACT 

TTACCCATTC TATGGAATCT TGCATTATCC ATAATAATAA 

GGTAAGAGAA ACTTCTGAAA CCAAGCTTCA AAAAAGTCGC 

GTCATTGGAG CGATTAACTC ACCATTCATT TGTTAGACCT 

ATATCTTCTT CCAGATACTT TGCCTCTTCT TAACTGACCT 

TCGATAAAAA TAAGTATCGA ATCCTGTTTC GTCAATCTAA 

ACTATTAAAA TTCTTAAGAA ATAAGGCTAC TTTTTCTGGG 

GTTCTTTTTT TTTTCGAGTG TAGCC 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4104 base pairs 



CAGAACTTTC 
TTGATATGAG 
GTTATGTTGG 
CTCTCATCCT 
AAGTCTTGGA 
CCATCATTAT 
GTGCTGCTAG 
CAGAGTTTAT 
TCCTTCAGAT 
TTCGTTTGGA 
AAATGGGAGC 
ACTACTACCT 
ACATTCAGAT 
TCCTATAAAA 
TAATTTCTTG 
GTACGAAACA 
CTCATCTTAT 
AAAAGGTATT 
TCTCAATAGG 
CTTCACACAA 
CCGATGGTGT 
TCGTCATCGT 
GCAACCAAAG 
TTTAATGAGC 
ACAGGTGCTA 
TCTTGTTCAT 



CAAACAACTG 
TGAATACATG 
CTATGATGAG 
TCTCGATGAA 
CGATGGTCGT 
CATGACCTCA 
AGAAGGACGT 
GAACCGTTTT 
TGTCGAGCTC 
TGTAACTGAT 
ACGCCCAcTT 
TGAAAATCCA 
TAAATCTGCC 
AAGGAGTAGA 
ACAGCTTGCC 
ATTGCTAAAA 
TTCAATCTGC 
GCAACTTGGT 
ATTGTACTCA 
GAGTTCTAAC 
GGTTAATGTT 
CTCTTCGTAA 
AAATTCTCTG 
GACCATATTC 
GGTGCTTTAA 
AGTAGGTGTA 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi> 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


169 : 






TTTAAGGTTT 


TAAAAAAAGT 


TTTCGAAAGG 


TTTCTTCTTT 


f\ 1 L 1 A 1 1 


CZCZJiCZ Afi AT A A 


OU 


CGTTGATATC 


TAAATCGTGG 


TCAAAGCCGG 


CAATTTTTCC 


111 nuA 1 Vjl Vj 


T ATTYIPTV A A 
Xt\X lljlj i\sAt\ 


ion 


TATCATAATC 


TAAATCAGTT 


TTAGGACTGC 


TCTCCAAAAA 


1 UV, X \3r\\j X \~ X 




ion 
lav 


CGGAATCCAA ACAGAGGTAA ACTTGCCTGT ATCAATACTG 




IXjAAOTAGAC 


240 


ACCAACGTAG 


ATGCCGATGT 


TTTTAGCACC 


CAGTGATGCT 


AGTTTTGCTC 


GAAAGTTTTC 


300 


GACACCTTCG 


TTCATATTAG 


ACATGGTTTT 


GTCTTCCACG 


TCAAGCCAAT 


AGTAACTAGG 


360 


GCTGTAAGGA 


GAGGCAGCAT 


TGTAGAAAAC 


TTCGGCAGCC 


TTTTCCATTT 


CTTGGACACT 


420 


TTTTCCAGCT 


ACATAAGCGT 


AGACAGCAAC 


TGGGACATTC 


CGCTTTTGAA 


GTTCAGTGAT 


480 


ATGACTCTTA 


TAGGCCTTGT 


CTATTCCATT 


GATAAATGAA 


GCATCATTTT 


CTTTTGTCGT 


540 


TTGAGCACCA 


CTGTGAACAC 


GAACAATAGC 


ACCTGAAATA 


TTTTGTGAGA 


GGGCATCGTA 


ouu 


GTTGATTTCC 


TCAGGACGCT 


GCCAGCCAGA 


n n.nrzTr' a at a 

unuu X Lnn X A 


ATCGGTTTGT 


CTAAGTGTTT 




CAAAGCCTGT 


GCTTCAATCT 


GTGCTATATT 


GGATTTTGTT 


TTAAACGATT 


GGCTGTCATT 


Tin 


AAGTGGGCGA 


TTGATGATTA 


AAATGAACAT 


CATAATCCCA 


AAAAAACTAA 


ATAAAATAAG 


n q n 

/OU 


TGGATGAATT 


TGTTTTCTCA 


TATCTTATAA 


TTCTACCCTA 


AAAATCAAAA 


AAAATCAAAA 


o*i u 


AAATGGGTTA AGGAAGAGAC TTTAGAGCAT 


TTTTTCATTC 


AAGAGTGCGG AATGATTTGA 




AATATGGTAT 


AATAAAAGGG 


AATTTCTACA 


GAAAAGAGAA 


GATTATGTCA 


AATTTTGCCA 


960 


TTATTTTAGC 


AGCGGGTAAA 


GGGACTCGCA 


TGAAATCTGA 


TTTGCCAAAA 


GTTTTGCACA 


1020 


AGGTTGCGGG 


TATTTCTATG 


TTGGAACATG 


TTTTCCGTAG 


TGTGGGAGCT 


ATCCAACCTG 


1080 


AAAAGACAGT 


AACAGTTGTA 


GGACACAAGG 


CAGAATTGGT 


TGAGGAGGTC 


TTGGCTGGAC 


1140 


AGACAGAATT 


TGTGACTCAA TCTGAACAGT 


TGGGAACTGG 


TCATGCAGTT 


ATGATGACAG 


1200 


AGCCTATCTT 


AGAAGGTTTG 


TCAGGACACA 


CCTTGGTCAT 


TGCAGGAGAT 


ACTCCTTTAA 


1260 


TCACTGGTGA 


AAGCTTGAAA 


AACTTGATTG 


ATTTCCATAT 


CAATCATAAA 


AATGTGGCCA 


1320 


CTATCTTGAC 


TGCTGAAACG 


GATAATCCTT 


TTGGTTATGG 


ACGAATTGTT 


CGTAATGACA 


1380 


ATGCTGAGGT 


TCTTCGTATT 


GTTGAGCAGA 


AGGATGCTAC 


AGATTTTGAA 


AAGCAAATCA 


1440 


AGGAAATCAA 


CACTGGAACA 


TACGTCTTTG 


ACAACGAGCG 


TTTGTTTGAG 


GCTTTGAAAA 


1500 


ATATCAATAC 


CAATAACGCT 


CAAGGCGAAT 


ACTATATTAC 


AGACGTCATT 


GGTATTTTCC 


1560 
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GTGAAACTGG 
TAAATGACCG 
AACACATGGT 
TTGAGATTGC 
TTGGTGCTGA 
-GAGCGGTCAT 
GTCCTTATGC 
TTGTTGAGGT 
TCGGAAACTG 
ATGACGGCAA 
CAACCATTAT 
TTACTAAAGA 
ACGAATATGC 
TTGAAGAAAA 
AAGATCAGGT 
ATGGGGCTGT 
ACCGCAAAGC 
AAAACACAGC 
GGAAATTAGA 
AACTATATTT 
AAACCTTGGA 
ATATCTGTGA 
GGAGGTCAGT 
CGAAAAAATT 
CTCTTCTTCC 
GAAGATTCAG 
TGTCTTCAAT 
TGTTTTAGCA 
CTGCTATGCC 



TGAAAAAGTT 
TGTGGCGCTT 
CAACGGTGTT 
TTCGGAAGTT 
GACTGTTTTG 
TACCAATTCT 
TCACATTCGT 
GAAAGGATCT 
TGAAGTGGGA 
AAACAAATAC 
TGCACCAGTA 
CGTGCCAGCA 
AACACGTCTT 
AACGCTTAGC 
TGAATTACCA 
CTGTGTTTTA 
TATCGAGGCT 
CCCTGTGGCA 
ACTCTTGTAC 
AGCAAGCGAT 
AGTCCTTGAA 
TGCCAAGACA 
ATGGGTAAAT 
TCAGGTCCTC 
CGTTTTGGTT 
GTCGAACCAT 
TCTAAGTTGA 
ATGAAACTTT 
AGAAGAACTG 



GGCGCTTATA 
GCGACAGCTG 
AGCTTTGTCA 
CAAATCGAAG 
ACAAACGGTA 
ATGATTGAGG 
CCAAATTCAA 
TCAATCGGTG 
AGCAACGTTA 
AAGACAGTCA 
GAACTTGGTG 
GATGCTATTG 
CCTCATCATC 
CGAAAAGAAA 
GAAGGCAAGG 
GCAGTAACGG 
GTCTCTTACG 
GCTGCCCTTC 
GATTTTTATT 
TTGACAAAAG 
GTGAGCTTAG 
ATTATGGCTG 
CTTTATTAAC 
CTTTGCTAGA 
ATGCCAATCC 
CTATTCATAA 
ATAAAATCTT 
TGTAATAGAA 
GCTTATCTGG 
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CTTTGAAAGA 
AGTCAGTTAT 
ATCCAGAAGC 
CCAATGTTAC 
CTTATGTAGT 
AAAGTAGTGT 
GTCTGGGTGC 
AGAATACCAA 
ATTTCGGTGC 
TTGGAAACAA 
ACAATTCCCT 
CTATTGGTCG 
CTAAGAACCA 
TCTATCAAGG 
GAACTGCCCA 
ATGAACAAAA 
AAATTCCAGC 
GTGAATTAGA 
CAGCTATTGG 
TGGAAAATCC 
AAGAAGCGAA 
TTCAGTATTG 
GGATGAAATG 
TGATAATGAG 
TAAGGATCAT 
AAGCCGTCGT 
ATTTGCGGTC 
AAGGAATTGA 
TCCAGCATTT 



TTTTGATGAA 
GCGTCGTCGC 
AACTTATATC 
CTTGAAAGGG 
GGACAGCACT 
TGCAGACGGT 
CCAAGTTCAT 
GGCTGGTCAT 
TGGAACTATT 
TGTCTTTGTT 
CGTTGGTGCT 
CGGTCGTCAG 
GTAGGAGCCT 
ACCAATATTT 
ACGGGATTTG 
ACTTATCTTG 
CGGAAAATTG 
GGAAGAAACA 
CTTTTGTAAT 
GCGTCCGCAG 
AGAATTAATC 
GGAGTTGCAG 
ATTGAAAGAG 
GAAACTAAGA 
GGTTTTAGCC 
ATTGAAAATA 
ATCTTTCTCT 
AATGAAAATA 
AGATAATGCC 



AGTCTTGGGG 
ATCAATCATA 
GATATTGATG 
CAAACGAAAA 
ATCGGAGCAG 
GTGATAGTCG 
ATTGGTAACT 
TTGACTTATA 
ACAGTCAACT 
GGTTCAAATT 
GGTTCAACTA 
ATCAATAAAG 
ATCATGGAGT 
AAACTGGTCC 
ATTTTCCACA 
GTCAAGCAGT 
GAAGTAGGAG 
GCCTATACAG 
GAGAAGTTAA 
GATGAGGATG 
CAATCAGGTC 
AAAAAATAGA 
CTAATAGAGG 
TTTTACCAAC 
AGGAAACCTT 
CCAAGAGAAA 
TGATTTTGCT 
GGAATTATTG 
CAGGAGCAAG 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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3000 
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TTGTTTTTGG GAATACCTAT CATACAGGAA CCA^C TCATGAAGTC GTTCTTGTAG 
AAAGTGGAAT TGGTAAGGTC ATGTCTGCTA TGAGTGTGGC GATTTTGGCT GATCATTTCC 
AGGTGGATGC CCTTATTAAT ACGGGTTCAG CTGGGGCAGT AGCAGAAGGT ATCGCTGTTG 
GGGATGTCGT GATTGCTGAC AAATTAGCCT ATCATGACGT GGATGTCACA GCTTTTGGCT 
ATGCTTATGG ACAAATGGCG CAACAACCGC TTTATTTCGA ATCAGACAAA ACCTTTG™ 
CTCAAATCCA AAAGAGTTTA TCTCAATTGG ACCAAAACTG GCATCTTGGT TTGATTGCTA 
CAGGAGATAG TTTTGTTGCA GGAAATGACA AGATAGAAGC GATTAAGTCC CAT^TCCCAG 
AAGTTTTAGC CGTGGAGATG GAGGGGGCAG CTATTGCTCA AGCAGCGCAT GCCCTCAATC 
TCCCAGTCTT AGTCATCCGA GCTATGAGTG ACAATGCCAA CCATGAAGCA AACATCTTTT 
TTGATGAGTT TATTATCGAA GCTGGACGTC GCTCTGCCCA AGTCTTGTTG ACCTTTTTGA 
AGGCTTTAGA TTAAGCGGAA ATTTGACAGT TTTTCTAGCT TATGATAAGA TTTAAGTAAA 
GAAAAGCTAG AAAACGTTTC AGAGGATATT ATGAGTATTG AAATGACCGT CAGTGAGATT 
GCAGAGGTCT TAGGATTATC TCGCCAAGCA ATCAATAACC GTGTCAAAGA ATTACCAGAA 
GAAGACACAG ATAAAAATGA CAAG 
(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 8876 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 170: 
CACGGATAGG CTCGGCTTTC ATCAGTCCTC AGGCTGATTT ACTAATAGCA AC^CCTCG 
ACAAAGTCCA CAGCGATACG TnTGGGTATC AATCCTACGC TTACGCTGAT ACCTTTGCTG 
GCAGGATTGG CAACGATAGA GCTTGATTGG CTTGGAGTTA CTATTGGGCA AGGATGGTAC 
AAACCGTAAT CCATCCACTG CTTTCAACAG TTCCTTAAAA TCCCGATCCT TGTGTTGATA 
GCCT^CCT TGAAAATAGA GGTGATAATG ACAGAGTTCA TGTCGGACAA TTTTCCTAAA 
AACGTCCAAC CCCAGTTCCT GATAAACCTT GGGATTAAAA TCCAAATGCC CATCTTTGGG 
GAAAAATCGC CCACCTGTCG AACGTAGACG CCTATTCCAC TGGACATGAT GGATAAAAGG 
TCTGCCGAAG TCTTCTAGTG AAACCTGCTT GACGTAATCA GTCAGTTTCA TTTGGAGCTA 
GGAGAGACAG ATTAACTTTT TCACGTTCAG TATCAATTTT CTTAACCCAA ACGCTCACCA 
AATCTCCAAC TGCCACCACT TGACTAGGGT GTTTGATAAA CTTGCGACTC ATATGGGAAA 



3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4104 
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300 
360 
420 
480 
540 
600 



WO 98/18931 



PCT/US97/19588 



TATGGATGAG 
TACGCACCAC 
GGCGAaCACA 
AATGATATCT 
AAGCGACTTG 
GAGTTCCTTA 
ACTTTCAGGG 
AACTTTCTTG 
ATTTTCAGAG 
ATTGACATTG 
TAGTTTCTTC 
TTTGACCAAT 
TTCAACGGTC 
ACCACTTTCA 
AAAAGCTTCA 
TTGACCAATT 
TTTAACAGGA 
CTTGGCACCT 
AACCAAGAGG 
CTCAGTTAAT 
TTGCTGAACA 
AAGAATACGG 
ATTGAGAGCC 
AATCTGAAAA 
GTGTCTCAGC 
GACCAAAATA 
GAACTTTTCA 
AAAGAGTCCA 
ATAGAGTTCT 



ACCGTCCTCA 
TCCTTCTAGC 
GGTGCGTCAA 
TTAAGAGTTT 
AGTTTGCTTT 
ACTGCAGTGT 
ATACGAAGGA 
ATTTGGGCGC 
ATAGTTTTGT 
ACACCAACTT 
TGACTGACAT 
TCCGCAAGAG 
AAGTCTGGAA 
TTAACGATAA 
CTTTCACGAC 
AAATCTGCTA 
TAAATAACCT 
GTACGAAAGG 
AGATTGCGCA 
TCTGTCCGAA 
ACTTCATCAA 
TCCGTCGCAT 
AAGGTACGAT 
ACCTGCTTTT 
ACTTCCTGAT 
TCAACTGCAC 
GCTTCTTTCT 
GCTTCACGGG 
TCAACGTCTG 



TGAATTCCGA 
TTTTGTCCAA 
AGGAATCACG 
CTGGACCAAG 
GGGCTTCTTC 
AATTCTCTGG 
AACCAGCAGC 
GTGAAGTGAT 
TGAGTCCAGC 
GGTTAACCAC 
CGTGTTGGTA 
GATCTTGCAA 
ACTCCTGACG 
CATAGCTGAC 
TGGCCGTTCC 
AATCTTTCTT 
GAGTTGTCAG 
CTGGGTCAAA 
GATTGTCAGA 
TACGACGCTC 
TATAAGCATT 
GTTCAAAACC 
AGCCTTGCAT 
CATCAAGACT 
AAGTCATAGA 
CGGTCAAGGC 
CTAAGTCAAC 
CAATGGTTGC 
CTAATTTTTC 



1081 
TATCAACAAA 
CCACTAAGTC 
GAAATCTCGA 
GTCTAACTCT 
GTTTAGGTCT 
GTGAACTCCT 
CTGCTCAAAG 
TTTTCCTTCT 
TACGTGTGAA 
TGTATCGACA 
TTGACCGACA 
ACGACGGGCG 
AGCAAGTTCG 
TTCAGGGAAA 
ATTTCCAATG 
GGCTTCTTCG 
CATTTTTCCT 
TCCAAGAACC 
AAAAAGTTGG 
GATAGCAGGC 
TTTCACCTTG 
GATCTTCAAG 
AGTTCCAACT 
TTCATCCTTG 
ACGCAAGGTC 
TTCCTTGCCA 
TATATTCTGC 
CTTGGTACGA 
GGCAACTAAG 



AGCACCGAAA 
CTTGATATCT 
CCTGGTTTGA 
TGCGCCATTT 
TTAATATCTA 
GTATTATCAA 
GCCTTGGCTC 
TCCTCGCGGT 
AGAAGAGCTG 
ACAAAGTCCA 
CCAATTGACT 
ATAGAAATGG 
CTGGCAGAAT 
TCTTTCAGAA 
GCAATAATCT 
ATTTGACGAG 
GTTGCATCCA 
ACGCGCCCTT 
ATAGCTCCTT 
AAGACCTTTT 
AAACGAGTAG 
ACACCAAGTT 
GTCTCTGAAA 
GCTTGAGAAG 
ACATCTTCCG 
GTCGCAAATC 
AAAATCAAGC 
CGCTTTTCCT 
ATAGCTTCTT 



TCAACAACGT 
AGGACATCTT 
GAAGATCTGC 
CCTTGACTGA 
AACGTTTGAA 
GGATATTGCT 
CCAGACGAGG 
ATTTGACAAT 
GGCTAGCTGT 
GACTCTCAGA 
TAGGATCGAT 
CAGAGCGTTT 
AGACAGAAGC 
CTTCCGCTAC 
CTACACCGTA 
CTGATGCTGG 
CGACAGCTAG 
TCAGTGGAGC 
CTTCAGCTTT 
TCTTAACGGA 
CAAAGAAGGC 
TCTCCCCACG 
AATCATAATA 
TAAGTTTAGA 
ATAAGGCTTC 
CTTCACAGAC 
GAGCAAGAGG 
TATAAGGAAG 
CCAATTCCTT 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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GGTCAACTTA 
TG TCAGACTT 
CATGTCCTTG 
AACGGTATCA 
ATCCATAAAT 
AAATATGTAG 
CCACCTTTTT 
AGTTCTGGAA 
CGATTGACTA 
TCACTAGCTG 
TTTCTTTTTT 
TCTATATCTC 
TCCTCTTCGA 
TAATCATCAG 
TTATAAATTA 
TCAAGAAAAG 
TCTAAATATT 
TGTTTTTTCG 
TCATAATTTT 
TTATAGATAT 
CAAATAAGGT 
TTTCCAAATA 
AAAACTTGAA 
TAAAATACTC 
TTTATCGCAT 
AACGTAATAT 
ATCTTAATAT 
TGATTTACAA 
TCTAATAAAC 
TTGACTACCT 



CCTTGTTCTT 
TTATCCAAAT 
CGATAACGCG 
ATTTGCTTTA 
CTATTATACC 
GAAATAGATT 
TTAACCAAGC 
GATGTTCCTG 
AAATTAATTT 
AATCTTTATT 
CCCACTTAGT 
CTAAGTGCCC 
AAACAAGTTC 
GAAGAATAAA 
ATCCTCCAAC 
GGAAAATTCC 
CACCGTCATT 
CTCTTTCTAT 
CTAATATGAT 
ATGTCGGGAA 
TGTTTATATC 
ATGAGCGATA 
CATTCGTTAA 
CATCAACCAA 
GATACATCTT 
AATCTCCTGT 
TAAATGATAG 
TAATAACTTC 
TATTTTTATC 
CCCATAATTT 



GAATCTTAGC 
CAATAATAGC 
CGATAAAGGG 
ACGTCACTCC 
ACAAGCTAAA 
TATATGCTAC 
CATGATATCA 
ACCTGGAATA 
ATTAGAATAA 
ATCAAATTTA 
TCGTGCTTCT 
CAAAGGATAA 
TTGTTCCATA 
TAAACCAACA 
ACAATTATTA 
TTTCTCTGCA 
TTTTAACCAA 
ATCATAGTTT 
TTTGTAGGAG 
ATTAATATAG 
AAATTGATTT 
TTGTTTTCTA 
ATTTTCTGTC 
ATCAGCAAAA 
TTCAAATGTC 
AATCATATCA 
ATTCATCTGT 
TATATCTTTT 
TCCTTGATGT 
TCTGATAATG 



1082 
TAAGACAGCT 



CTTAATCGCC 
AATAGTCGCC 
CAAATCCTGA 
CGTTTCAAAT 
AGCGCAATAA 
AAAGTATTTA 
ACACATTGAC 
GGAAGATTAT 
AAATAAAGAT 
TCAATACTAG 
ACTTCATGAG 
TAATAACGAA 
AAAGGTGTTC 
CTTATAATCG 
GCTATTAACT 
GCACTAAAAT 
TCTAAGACGG 
TCTTTTAGAG 
GTTGCAGTTT 
ATTTTTCGTA 
ATTCGATGAT 
AACCAATTAT 
TGACCAAGAA 
CAATCAAATA 
GACAACTCAG 
TGGCTAATGG 
AATGTTTGTC 
AACAAAACAA 
ATTTTCTTTT 



TCCTTACGGT 
ACCTCATCCA 
CCTTCAGCTG 
GAGATTTTTT 
TAACTCGTAG 
CTTGCACTTA 
ATGGATCAGA 
TTTTCAAATT 
CCATCTTATT 
TATTCCAATT 
AATAATGTAG 
TCCAGCTCGG 
AATGCTTTGT 
TATATTGAAA 
TAAAATCTAA 
TATGATAAAC 
TTGCCAATTC 
CGCAATCTTT 
GTTTAGCATC 
TAGAGTGAAT 
AAAGCTTACT 
CTGTATCATC 
CCCCCCAAAA 
CAACATCAGA 
ATGAATCATT 
CAAAAGAATT 
AAGCTATCTC 
TCTCCACTAT 
CACTAATTGA 
TATTTAATTA 



CATTGAGATT 
GACTACCAGT 
TCAAACTTAG 
CATATTTTTT 
AACATTTAAA 
AAGAGCATTG 
CATAATAGCC 
TTTATATGGA 
TAAAATTTCT 
TATGCGTTTT 
AAAATGAATA 
TGAAATAAGT 
AAGTTTATAA 
ACCAAGCTGT 
TCTATCAAGC 
AATATCAGAA 
TTGAATATAT 
GATTCTATTT 
TATAACAGGT 
ATAAAGTCTC 
ATTGAATAAT 
CATCTTTTGT 
AGGATAAA/\G 
ATCGGATAAT 
TGAAGATAGA 
CTCATCTATA 
CTCTGTAGAT 
TGACAAAGAC 
GTAAGTCAGT 
TAGCACAATT 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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3060 

3120 
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ATGATATATA TCAGGTAATA TCAAGCTATA TTATCTCTTA GCTACTCAAT TTGAAATTTT 4200 

AACTTTTCCC TTTTCCGCAA AATAATAGTA TAATAGAGGT AGAATCTAGA ATCGAGGTAC 4260 

ACCTATGGCT GTCAAATTTA CAAAACGAGA CGACTTGGAC AAGATGTTTG AAGAGTTTGC 4320 

TAAACTCCCT GATTTGAAAC AAGTTACTTT CCCTGATGAC AAAGAGAAAA AAGTCAAAGC 4380 

AGAAAAGAAA AACTAGATGA CTGCTTTTCA ACAACTCCCA TCTAGTGTAC TTCAAACTGG 4440 

JVGCCATTTTT CTCTCCATTA TCATTGAAGC CCTTCCCTTC GTTCTGATAG GAAGCATTGT 4500 

CTCAGGGCTG ATTGAAGTTT ATATCACACC TGACAAGGTT TATCATTTTC TCCCTCGAAA 4560 

TCGTTGGGGG AGAATCTTTT TTGGGACCTT TGTCGGTATA CTTTTCCCTT CTTGTGAATG 4 620 

TGGAATCGTC CCCATCATCA ATCGTTTTCT GGAAAAAAAG GTTCCAAGTT ACACGGCCGT 4680 

TCCTTTTCTT GTGACAGCAC CTGTTATCAA TCCCATTGTT CTTTTTGCGA CCTATTCTGC 4740 

CTTTGGCAAC TCCTTCCATG TCGCCCTATT ACGAGCTCTG GGTTCCATTC TTGTGGCTGT 4800 

AATACTAGGA ATTTTTCTAG GATTTTTCTG GCAAGAACCG ATTCAGAAAG AAAATCGTCT 4860 

GGCTTGTCAT GAGCATGATT TTTCTTACTT GAGTTCTGCA AAAAAAGTTT TTCAAGTCTT 4920 

TGTGCAGGCC ATTGATGAAT TTTTTGATAC GGGGCGTTAT TTGGTATTTG GCTGCCTCTT 4980 

TGCTTCTATA ATACAGGTCT ACGTTCCGAC TCGGATTCTG ACCTCTATCA GTGCGACCCC 5040 

TCTTTTTGCC ATCCTGCTCT TGATGATTTT AGCCTTTCTT CTTTCGCTCT GTAGTGAGGC 5100 

GGATGCCTTT ATAGGTGCTT CTCTTCTCTC GAGTTTCGGT TTGGCACCAG TTCTGGCCTT 5160 

TCTCGTCATT GGTCCAATGC TGGATATCAA AAATATTCTC ATGATGAAAA ATTACTTGAA 5220 

AGCACGATTT ATCAGTCACT TCATAACAAT TGTAACTCTT GTCGTCTTAG TCTATTCTCT 5280 

CTTGATTGGA GTTATCCTAT GATTCGATTT TTAGTTTTAG CTGGCTATTT TGAACTGACT 5340 

ATTTACCTCC ATCTGTCGGG CAAACTAAAC CAGTACATCA ACATGCACTA TTCCTATCTG 5400 

GCCTATATCT CCATGGTGCT TTCTTTTATC TTGGCTATCG TTCAATTGTA TATCTGGATG 5460 

AAGCAAGTCA AAACCCACAG TCATCTGAAC AGCCGATTAG CCAAGATAAC GAGTATTTCT 5520 

CTTCTGGCTA TTCCACTTGT CATCGGCTTA ACTTTCCCAA CTGTTAGCTT GGATTCTCAG 5580 

ACTGTTTCTG CTAAAGGTTA TCATTTCCCC CTATCGGAAG GAACGGATCT AGCCATTCAG 5640 

ACAAGCGAAG GGACGACAAG CCAATATTTG AAACCAGATA CCAGTTCTTA TTTTTCAAAA 5700 

TCAGCCTATG AAAAGGAAAT GCGAACGGCG GCGGATAAAT ACTTATCCCA AGATAGTATT 5760 

CAGATCACTA ATGAAAACTA TATGGAAGTC ATGGAGGCTA TCTACGACTA TCCAGATGAG 5820 

TTTGAGGGCA AGACAATCCA GTTTACAGGC TTTGTCTATA ACGACCCCAG TCATGCCAAT 5880 
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AGTCAATTTC 
TTGCTGACCA 
AAACTGGTCA 
AGCTTTACCA 
CAAGATAAAA 
TTATATGAAA 
GCTCATAGAA 
GTATAAAGCT 
TTTAGGGCTT 
TCTCCCACTT 
TTTCATAATT 
TTACAGCTTT 
GAACATCTAA 
TATCCAAAGG 
TTACTTCTTC 
TCAGACAGAT 
CATCTCGAAG 
GAAAATCCTT 
TCTACTTAAT 
ATTCTTCGCG 
GGTATTCAAT 
GGATTCGCTT 
CACTAATATT 
CTTTTAATTC 
GATGAAACTA 
ACTTCCTGCG 
TAACTGATGC 
TAGCTGTATT 
TAAGCCTAGA 
AAGAAATTGC 



TGTTCCGATT 
AGGGCAATAC 
ATCACTACCA 
AAGTCGATAA 
ACGAACAAGT 
ATTAGTGACT 
AATAAGOGAG 
CATGGCCAAA 
CCTCTCTCCA 
CTCTCTTAAA 
TATAGCCAAC 
CTCCATATTT 
GTCCGAAACA 
TAAAATCTTA 
AAATTCCGAT 
AGATTCTTCC 
ATGCTTTTCA 
GCTATGATAG 
ACTATGCTTA 
TTTTTTTTCT 
TGAAATGCGT 
AATCAAGTCT 
AGACATTGTA 
CATTCGATTA 
CAAAGAATAA 
AAACAAAATA 
ACGATTTAAA 
AAAAACAGCT 
AGACCTTCTT 
GGCTGATTTT 



CGGCATTATC 
CCGGCAGTAT 
TAAAGAACTC 
ACCAGAAAAT 
TCTCTTCTGA 
TGTAGATTTT 
CCACTCATTC 
GTTTTCTAAA 
TGTAGCTTCA 
AGCTTGTATT 
TCATATCTAT 
TCTGACCAAT 
ATTTGAGATT 
TCTAAATCTA 
GCGAAATAAC 
TCTACAATTT 
TATGGATTTA 
GCATTGCTTC 
TTCATCTTTT 
GGAATCTCAC 
TTAATCGTAC 
TGTGGGACTA 
AGCCTTTTCT 
GTCCTCCTAT 
ACTTTAAGAA 
TGGTATAGTA 
CGTCTTGTTG 
TATCAACTTA 
ATGGCCACTC 
GGTATTCACG 



1084 
CACTGTATCG 



GAAAACAACA 
AAACAAAACC 
CCCTATGTAT 
ATAACAGAAA 
CATCTTATAC 
ATTAGACTAG 
AAAATAGTAT 
TTAGGATAGC 
TTTCTCCGTA 
TATACTCAAC 
GCTTTGCTTC 
TGATATAATT 
GATAGCCACC 
GAGCTAAATC 
CATTTTTAAA 
GAAAATAGAC 
CCAAACCGCC 
GTTCAAAGAT 
CACTTGTTTG 
GTGTTGTAGC 
GCGCCAGAAT 
TAATCATTTC 
TTTCTCTAAG 
ATCTTCTCAC 
GTTCTATGAA 
GTGTTCAGCG 
AACACGCAAA 
TTCAATATGT 
AAAGCAACTT 



CAGATTCTGG 
CTTGGATAAC 
TTCCAACCTT 
ATAGAGCTTT 
AAGAGCCTGT 
CATTCCCAGC 
CGATTTCTTT 
CAAAATAGTC 
GAGGACTAAT 
GcGGAGTATC 
ATTCCAGTGA 
AGATTTTTCT 
TTTAGTTTCC 
ATCCAAAAGA 
TCCTCCAAGA 
CCATGATTTC 
CTGCGAATCT 
AATAAAATAT 
AGTTGTGATA 
AGCTACAGCG 
ATGTTTTCTG 
CTTCTCAGTA 
CTGTTCTTTT 
TTAAATTATG 
TGATAAGATT 
TTATGAAGCA 
CACGACTTTT 
AGGTGGACGA 
GCGAGAATAC 
AATCCGTCGG 



TGTCTATGGA 
AGCCAAAGGA 
GGAAATCGAC 
TTAAGAAAAT 
TCGTTTTTTG 
AATACAAGTA 
AGGTGCTTGA 
TGGCAATTCT 
AAACAAGGTA 
GCTTCTATAT 
TAAGACTGTC 
TTAGAAGTAA 
TCTAACTCTG 
ATCAGTTTCT 
GAATGGCCTA 
AATTCTGTTT 
AGTTCTTGAA 
TTTTTCATTC 
ATCTGACGCA 
TAGAGTTCAG 
ACAAAAAACG 
GTTTCTTTGT 
TCTGTAAAAT 
TACTAATACA 
TTAGCATTAG 
AGTAAACAAC 
GAAGAGATAT 
AAACCTAAAT 
CGCACTTATG 
AGCCAATGGG 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 
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6600 

6660 
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TTTAAGTAAC TCTTGTTCAA AGTGGTGTTA CGATTTCAAG 
ACACGGTAAT GATTGATAGC CATTCCCATC AATATCGTAT 
TGTTTCATTT TTGCGTGGTT TCTGGCTATT AACGATTGAA 
AAATAGAAAT AAAAATCCTA AGATTACTGT CATATCATAA 
CTTATCATTA TCCATGATAA AAGGCTTAGC CAGTCCCTCG 
GGTGCCCAAA TACTTGTAGC AATCTTCCTT ACTAGCAAAT 
TTCGAAAGTC AATTTCTCTA CAAATAAGAA ACCGTCATCA 
GTGGCCTACA AACAGATACT CGCCATCCAA ATTGTCGTGC 
AGCTTTTTCA TTGAATTGAA ATTGTGAGAA GAATGCTTCC 
GACATCTGTA GTTGACTCAG TTGGAACTCT CGAAAATAGA 
TTGTGAATCA AAGACCTTTC CTTTATCAAT CGCATCATTA 
ATTCTTTTCA AGCTTTGGAA TGGTGACTGA ATTTTTCAAA 
GCAGTTGGTC CCAACAAAAT CGCCCTTCTT TTGATTCCAG 
ATCGTATTCG GTGTGAGTAA AGGAAGTGAA ATCTCCTGAT 
GGTATTATAG TCATTAACGA GATTAAAAAA TGCATCAACA 
TGATAAGAGA GATTTGACCT CTTCTGTACT TACCTGGTTG 
TTTCCATGGA ACTTTCGCTG AACTGCTTTG CCTTTGATTC 
TTGTTGTTGA CAAGCAGCCA AGCCTAAAAA CAAGGCTGAA 
TTTTCTTGAT TTCTTCATTT CTTTCTCCTA AATGTCTTGG 
TGCTTTACAG ATATTGATTA CTTTCTCATT TAATGTGTTC 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 14736 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



AACTCCTCTC 
CTTTGGACAT 
ATAACCCACC 
CACTATTAAA 
CCTGTATAAT 
TTAATCGCTT 
GCAGGTACTA 
AAGACTACAG 
ATCTTTTCAG 
ATATCAAACT 
TCTAGGAAAA 
AGACAATAAC 
AGATGACTGA 
AAGCCAGTTG 
CTATTTGGAT 
TTTAGGTTGG 
GTCCCCTCAG 
CAGATTCCTA 
ATTAAAGTTT 
ATCGTCTTTC 



AGTTCTGAGG 
AGCCAATAAA 
AACTTATCAA 
GTTTAACCCA 
CCGCATACTT 
GGTAGGGCTC 
AGACCCCAAC 
ACAGCATTCG 
CGTGAACCTT 
CTTCCTTATC 
GCAACTGGTC 
TATTGATACG 
TTTTCTCAAC 
AGCCGACAAT 
CCAAGTGAGC 
TGTATGAAGC 
AAGTAGCATG 
ATGTGGCTAA 
CTTTAACTAT 
CTCCGG 



7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8876 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
CGCAAACTTT CGCGGTCGGA AGGTAGTTTT ATGACACGAT TTGAGATACG AGATGATTTC 
TATCTCGATG GAAAATCATT TAAGATTTTA TCTGGTGCCA TTCATTATTT TAGGGTTCCT 
CCAGAGGATT GGTATCATTC GCTCTATAAC TTGAAGGCTC TTGGTTTTAA TACGGTAGAG 
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180 
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ACTTATGTTG 
CTGGATTTAG 
CCGTCTCCAT 
AAGAACATGC 
GATCAGTTAT 
ATGCAGGTTG 
CGACAGCTAA 
CGAGCTACTC 
GGTTCTAAGG 
AAATGGCCAC 
ATTATCACAC 
TCTATCAATC 
GCTCGAGGAA 
GAAGAAGGAA 
TCAGAGTATC 
CTAGTTGAAA 
CTCTATCCTC 
GAAACAAACT 
CAGCTGTATG 
ATTTTTTATC 
GGGCGTGTCA 
GGGGTCTGTA 
AATCCTGAGA 
TATGACTTTA 
GGGGTTGCCT 
TCACTTTATA 
ACAGAAGGTC 
AAGGGGGAAA 
AAGTAGCCAA 
AAGTTGTCAA 



CTTGGAATTT 
AGAAATTTCT 
TTATCTGTGC 
GAATTCGCTC 
TGCCAAGACT 
AAAATGAGTA 
TGGAAGAGTG 
TGAAAGCTGG 
CACCTTACAA 
TCATGTGTAT 
GGGATCCTAA 
TTTACATGTT 
CTTTGGACCT 
ATCCAACTGC 
CGCAGTTGGA 
AAGTTTCTTT 
AAAAGATGGA 
GGGATGCAGA 
TCGATGGTCA 
AAGGTAAAAA 
ACTATGGGCA 
AGGATCTGCA 
AAATTGATTT 
CAGTCGAAGA 
TTGTCAATGG 
TCCCTCATAG 
AATATAAAGA 
ACTTATGACA 
TCTTTGGGCT 
CAACGATATT 



ACACGAGCCT 
CCAAATAGCG 
GGAATGGGAA 
ATCCGACCCA 
GGTGCCTCGT 
TGGTTCTTAC 
TGGCGTAACC 
AACCTTAATT 
CTTTTCGCAG 
GGAGTTCTGG 
GGAATTGGCA 
CCACGGTGGT 
GCCACAAGTT 
TAAATATCTT 
ACCACTCTAC 
GTTTGAAACC 
GGAGCTGGGA 
AGAAGAAAGA 
GTGGGTTAAA 
GAAAGGGCTA 
TAAGTTCTTA 
TTTCTTACTA 
TTCAAAAGGA 
GCCAAAAGAT 
GCAGAATCTA 
CTATCTCAAG 
AGAGATTCAT 
ATTGTAGGAT 
GGAAAACTAA 
GAAAAGAGTG 



1086 
TGTGAAGGTG 



CAGGATTTGG 
TTCGGTGGCT 
GCATATATCG 
TTGTTGGACA 
GGAGAAGATA 
TGTCCCCTCT 
GAAGAGGACC 
ATGCAGGAAT 
GATGGTTGGT 
GATGCAGTTC 
ACAAACTTTG 
ACGTCTTATG 
GCAGTCAAGA 
AAAGAGAGTA 
TTAGATAGCT 
CAAAGTTATG 
CTTCGTATCA 
ACTCAATATC 
TCTAGGTTAG 
GCGGATACGC 
AACTGGAAAC 
TGGACTCAAG 
ACTTACCTAG 
GGACGTTTTT 
GAAGGTGCCA 
TTAACTCGTA 
GCCGTATTGA 
ATGTTTCACG 
GTTTGAAACT 



AGTTTCATTT 
GTCTCTACGC 
TACCAGCTTG 
AGGCAGTTGG 
ATGGTGGCAA 
AGGCTTACCT 
TTACATCAGA 
TCTTTGTAAC 
TCTTTGATGA 
TCAATCGCTG 
GAGAGGTTTT 
GTTTCATGAA 
ATTACGATGC 
AGATGATGGC 
TGGAGTTGGA 
TGTCAAGTCC 
GCTACCTACT 
TTGATGGTCG 
AGACAGAGAT 
ATATCTTGAT 
AACGTAAGGG 
ACTATCCACT 
GACAACCAGC 
ACTTGTCTGA 
GGAACGTTGG 
ACCGCATCAT 
AACCTACACT 
TGGACGTTTG 
CATTATGGTT 
TGCGACACCA 



TGAAGGTGAT 
AATTGTGCGT 
GCTCTTGACC 
TCGCTACTAT 
TATTCTCATG 
GAGAGCGATT 
TGGTCCATGG 
AGGAAACTTT 
ACATGGTAAG 
GAAAGAACCG 
GGAACAAGGC 
TGGTTGCTCA 
CCTTCTGGAT 
AACACATTTT 
TGCTATTCCA 
TGTAGAAAGT 
TTATCGAACA 
AGATAGGGCC 
TGGGGAAGAT 
AGAAAATATG 
AATTCGGACA 
CCCACTAGAC 
CTTTTACGCT 
GTTTGGTAAG 
CCCAACTCTC 
TATCTTTGAA 
AAAACATATA 
ATCCACGGAC 
GTAGACGACG 
CCAGGTGTGA 
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AATTGAGTAT 
GCCAACGTCT 
GTGTACCACT 
TTACACGTTC 
AAGGTGTTAA 
TATTAAAATA 
TACTTCTCAC 
CAGGTTCCCC 
GTTTACTTAT 
CTTCTCGTAT 
AATTGATGCA 
CTTCGACGTT 
CGAACGCTTT 
TCTATCTCGT 
AGTAGTAGAC 
TATGCTTCCA 
TCACTACCTT 
AACAGGTCTT 
AAAAATTGGT 
TATTTTCCTT 
TACACCATCA 
AAGATTTTAA 
AACGTATGCA 
GTGATGGAAC 
CACCATTCTT 
TAGGTTCAAA 
TTGGGGATAC 
TGGCTATCGC 
ACATCTTCCG 



TTTGCCAGTT GAGAAAGCTG CAGCCAATAT TCTTGGTGGC AAATACGATA 
CTTTATCGTG GCTCGTAAAC CAGACCGCTT CCTTGGTTTG GTAGAAGCAG 
TGAAACCCTT AATGTTGGGA ATATGTCTCA AACACCAGAA ACTCGTTCTA 
TATCAACGTA GTAGACAAGG ATGTGGAAGA CTTCCACAAA CTGGCAGAAA 
ACTTACTGCT CAGATGGTTC CAAATGATCC AATTTCAGAC TTTTTGAGCT 
GGAAAAAAAT TTTTAGGAGG TCATTGTTAT GATACAATGG TGGCAAATTT 
TTTGTACTCA GCTTATCAAA TCTGTGATGA GTTGACGATC GTTTCATCTG 
TGTATTTGCT GGTTTCATTA CTGGTTTAAT CATGGGAGAT GTGACTACTG 
CGGTGGTAAC TTGCAACTGT TCGTTCTTGG GGTTGGTACC TTCGGTGGTG 
CGACGCAACT TCTGGTGCGG TTCTTGCGAC ACCTTCTCTG TTTCACAAGG 
CCGCTTGCCA TTACTACAAT CGCTGTACCA GTAGCAGCTC TCTTGACTTA 
CTTGGTCGTA TGACTACTAC CTTCTTCGCT CACCGTGTGG ATGCTGCAAT 
GACTATAAAG GTATTGAACG CAACTACTTG CTTGGTGCGA TTCCGTGGGC 
GCCCTTCCAG TCTTCTTTGC CCTTGCTTTT GGTGGTGCCT TTGTACAATC 
TTCGTTGAAG CCTACAAATG GGTTGCAGAT GGCTTGACAC TTGCAGGACG 
GGTCTTGGAT TTGCAATCTT GCTTCGTTAC CTTCCAGTTA AACGTAACCT 
GCTATGGGAT TTGGTTTGAC AGCTATGTTG ACTGTTCTTT ACTCATATGT 
GGTGGCGCTG TTGCTGGTAT CGTAGGTACT CTTCCTGCTG AAGTTGCTGA 
TTCGTGAACA ACTTCAAAGG TTTGTCTATG ATTGGTATTT CTATCGTAGG 
GCAGTGCTTC ACTTCAAAAA TAGCCAAAAA GTAGCTGTAG CAGCACCTTC 
GAAAGTGGGG AAATCGAAGA TGACGAATTC TAATTACAAA CTTACAAAAG 
TCAAATCAAC AAACGTAGCT TGTTTACTTT CCAATTAGGT TGGAACTACG 
AGCTTCTGGT TACCTTTACA TGATCTTGCC TCAGTTGCGT AAAATGTATG 
TCCTGAATTG AAAGAAATGA TGAAAGTTCA TACTCAATTC TTCAATACTT 
CCATACCATT ATCGCTGGTT TTGACCTTGC CATGGAAGAA AAAGATGGTG 
AGACGCCGTT AACGGTATCA AGACAGGTTT GATGGGACCA TTCGCTCCTC 
AATCTTTGGT TCACTTGTAC CTGCTATCAT GGGGTCAGTC GCAGCAACTA 
TGGCCAACCT TGGGGGATCT TCCTTTGGAT TGCAGTTGCA GTAGCGTATG 
TTGGAAACAG TTGGAATTTG CTTACAAAGA AGGGGTTAAC CTTATCAACA 
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ACATGCAAAG 
TGGGTGCTCT 
AAAAGATGAT 
TCTTTACTGC 
TCGGTATTAT 
TGTAATTCCT 
TTAGAGGTAG 
TTCCAGAAGA 
ATGATTTCCT 
TGATCATGGA 
AATTTATCAA 
GCATTGTGAA 
CTTCGAAAAT 
TCGTCAGTCT 
CGGCAACCTC 
GAACTCGATT 
CATGGGAATG 
CATTATACAA 
CAACAGCCTG 
GCAGCCTTCC 
GCTGGGACTT 
GAACGCAAAT 
TATTTGAAAA 
GAAAGTTTGG 
ATTACTTGTG 
TTGCTCTTGC 
TCTATGATGT 
CGTTTTGAAG 
GAGCTCGTTG 
CTTGCTCATG 



TACCTTGACA 
TGTAGCAACA 
TGATTTCCAA 
CTTTATCTTC 
TATCGTACTT 
TATGACTAAA 
CACAGAAATG 
TGGCCCAGAA 
AGTCTTTGCG 
AGGTCGTGAT 
TGCGAGCCTT 
AGTTAATGAC 
CTCTTCAAAC 
TATCCGGCAA 
AAAGCAGTGC 
CAATTCATGT 
TAGCTTACTC 
AAGAAGACTT 
ATGTATGGAG 
TACAAGAAAT 
CTGCTTATGT 
GGAATTTCAA 
AAGATGTGGC 
CGACTGTTGA 
CAGCAGATGG 
AACCAGCTGT 
TGACAACTCT 
TTGTATCTAG 
ATTTAGACTT 
AAGCTCAGCT 



GCTTTGATTG 
GTGATTAACT 
GACATCTTGA 
TGGTTGCTTG 
GCTTTGGCTC 
TCATTAATTT 
ATTATGGGCC 
GAATTTACTG 
GATCTTCTCG 
ATTGACCTTT 
ACAGGCGCAG 
CTGTTAGCGG 
TACGTCAACG 
CCTCAAAACG 
TTTGAGCAGC 
GACAACGTGA 
CCATTCCCAT 
GCTCGAATTG 
AGAAGCTTTT 
CGCTGATAAA 
GGGAGATACC 
TGCTATTGCG 
AACTGTCCTT 
TTTGGCCAAA 
TAAATTGGCT 
CTCTAATGAT 
CTTGGTCTTT 
TCTTGCCCGT 
TAACCGTGTC 
CAAGATTTTG 



1088 
ACGCTGCATC 



TTGAAATTTC 
ACCAAATCTT 
GTAAGAAAGG 
TTTCTGCCCT 
TGGTGAGCCA 
CACAAGACAA 
CTAAATTTGA 
GTGGGACACC 
ACGCAGGGAT 
ATGCGGACTA 
GCTTCGATGA 
TCGCCTTGCC 
GTGTTTTGAG 
CTGCGGCTAG 
AAATCGTTAG 
ATTTAATAGA 
GGTGCAGAAA 
GAATTTTATC 
CATGACTATA 
TTGCTACCTT 
ACAACAGATA 
GTGTCTTTTG 
TCCTTGGTGG 
CTTCAAGCTC 
GCTGGATTTG 
GATCCTACAG 
AAAGTTTTAG 
ATCTATCTAG 
GAATTAACTG 



TGTACTTGGT 
TTACAAGTTG 
CCCACGTTTG 
TATGAACTCT 
TGGTCACTTT 
TGGTCGCTTC 
CATTTACACA 
AGCTGTTATT 
TTGTAATGTG 
GAATCTTCCA 
CAAGAGCCGT 
TGACGAAGAT 
GTAGgTATAT 
CTGACTTCGT 
TTTCCTACAG 
AGCATTTTAT 
AAAAGAGGAA 
TCACTACGCG 
AAGCAAAACG 
TTAAGGTTAT 
ATTTTAAGGA 
TCGTTGCCAA 
CTCGTAGTGG 
ATGAGCTTTA 
ACGGTGATGA 
CCATGACTTC 
AATTTGCTGT 
ACAAGGCAGA 
GCGCTGGTCC 
CTGGTCAAGT 



GTCTTCATGA 
CCAATCGGTG 
CTTCCAGCAA 
ACTAAAGCTA 
GCACTTGGAA 
TGTGAGGAGC 
GTAGCTCTTC 
GAAGGATTGG 
GTGAGTCGCT 
ATGGTGATTG 
GCTGCAGAAA 
GAATAATACT 
GTTACTGACT 
CAGTCTTATC 
ATTTTAGTTG 
ATAGAATATA 
CTCAATGCTA 
TGAAATCTAC 
TGAAGAAATT 
CTTGACAGGT 
AGTCTATGAC 
TCCAGCAACC 
GAATTCGCCT 
TCAAGTGACG 
TCGTAATCTC 
TAGCTTTACG 
TAAGTCTGAA 
AGATGTCAAA 
TTTCTTTGGA 
TGCGACCATG 
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TATGAAAGCC 
GTTTTGGTCT 
GAAGTTGCTG 
CTTGAAAATG 
GTCTTCCCTT 
AATAAACCAG 
ATTCACGAAT 
TATCAGATAA 
GTAAAAGGAG 
GGATGTCTTG 
TGGTGCGACT 
CTTGGGATTT 
AGGTCCTGTA 
CCTTGAGGTT 
CTTGTTTGAA 
TGGGACAGGA 
TGGTGCCTAT 
CAACCACAGC 
CCAACTAAAC 
AGAAGCCAAC 
TGCAGAAGAA 
TGCAGGCCAT 
CAAGACAGAA 
CATAGGAGGT 
AGATGCCATT 
CAGTAAGACA 
GCTAGGAATA 
GCTATCTGTT 
GTATTTGCTA 



CAGTTGGCTT CCGTCACGGT 
TTGGTACAAC GACAGACTAC 
GTGACCAGAT TGCTCGTCGT 
TCAAAGAAGT GGCCCTTGGT 
ACATCGTTTA TGCCCAACTC 
ATACACCGTC TCCTACAGGT 
ATCAAAAGTA AGACAGTGTT 
ACCATAGATT GTCAGTACGC 
AACAGAATGA AAGCATACAC 
GCCTATCGAT TTGAGACAGA 
ATCTTGCGCT ATGTCGCACC 
GATGACTTTG ATAGTTATGT 
GCGGGTCGTA TTGCAGGTGC 
AATAATGCTA GCAACTGTAA 
GTTGAAGAAG TAAGCGATCA 
GGGTTCCCTG GAAATCTCAA 
GAAATCAGCT ACAAGGTAAC 
TATTTCAACT TGTCTGGTGA 
ACAGAGGGCA TTTACTCAAT 
CGTGATGTGG TCAAACACGT 
GATGAGCAAA TCCAGCTGGC 
GACAATGCTG GATTCCTTTA 
GCTCCTTGCT TTGTGGTCTA 
CAGCCAATGC TACAGCACAA 
CACAGTGACC TTAAAGGCCA 
CGTTATGAAC TTGTTGTGAA 
GGTACGCAGA GACAAATAGT 
AATATAATAT TCAAACTACA 
GTGCCTTGGC TTTGACCTTG 



CCAAAATCTC TTATCAACGA CAATACAGTT 
ACTCGTAAGT ACGACTTGGA CTTGGTTCGT 
GTTGTGCTTT TGAGTGATCA AGCTTTTGGT 
TGTGGCGGTG TCTTGAATGA TATTTACCGT 
TTTGCTTTAT TGACTTCACT CAAGGTAGAA 
ACAGTAAACC GTGTAGTACA AGGTGTCATA 
TATGAATTCT TGACAAGAGG ATTTGTAAAT 
TTTCTATGGT TTGTTTGCTT GAGAGAAATA 
AGAGCGTGTA TTTGGAAATG TTGAGGGTGA 
CGGTGGCTAC CAACTTGAGG TTATGACTTA 
TGACAAGGCT GGAAATTTTG CCAATGTTAT 
AGGCAATAGT CCCAAGCATG GAGCAAGTGT 
GACCTTTGAG CTCAATGGTA AGACCTATGA 
TCACAGTGGT TCAACTGGTT GGGATTCCAG 
TGGCTTGACT CTCTACACAG AGCGTACAGA 
GATTTGGATC AGTTATCACT TGGAAGAAAC 
GACCGATCAG GATACGCTGG TCAATCCAAC 
TTTCACGCAG ACGATTGACC GTCATGTCTT 
CGCTCCTGAC GGTGTTCCTG CCAAAACTCC 
CTACAATGGT ACCTTGTTGA AGGATATCTT 
ATCAGGTTTG GATCATCCAT TTGCCCTTCC 
TGACCAAAAT TCAGGTCGCT TCCTGCTTTT 
CACAGCAAAC TTTGTGGATG AAAGTGTCAT 
TGGGATTGCT CTTGAAGCGC AAGCTTTACC 
AGTCATTCTT AAAGCTGGTC AAACCTTCAC 
GTAAAAGAGT CATTGCGCCT ACTTTTGGGA 
AGGAAAATAT GATATAACTA AGCGTTGAAA 
ATAAGGAGTA AGAAAGAAAC GAAGAAAATT 
GCTGGAGCAG TTTTGACAAA TGATGTTTTT 
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GCGAACGACA 
TCAGAGGTGC 
GCTCCTCATC 
GAAGGTTTGG 
TTTGCCAGAG 
CTTTGGAGTG 
GACGGTTTTC 
AAAATAGTGG 
TTACACACAT 
TTGCTCAAAC 
CTGTTCAGTG 
TTGTAGAAAA 
AATCTAATGG 
CAGATGGTAA 
TTTCTGGTAG 
ATGGTAGCAG 
AAAATGGCAC 
TCGGACCACA 
CAGATGGTTA 
AAGCATTCAT 
TGGCCTCTTT 
GTAAGCTAAT 
GTAATTAGTC 
AATTTACTGT 
TATCGATTTA 
TATTATTTTT 
AAAAGCGAGG 
AGAAAGTTGT 
GGGATTCAAT 
GAGGGTAAAT 



GACTTGTGGC 
TAAAACCTTC 
AACAATCTAT 
TAGATAAGTA 
CTACAGAAGC 
CCTTTCCAAC 
TAAAAGCTAT 
TTCAGACAAC 
GGGGATGGCA 
TCTAGGTGAT 
TACAGAAGTG 
ACCACTGAAA 
TAAATGGTAT 
ATGGTACTAT 
CTGGTATTAC 
ATGGTTCTAC 
TTGGTATTAC 
CTGGTACTAT 
CCGTGTAAAT 
CTTACTTAGC 
TGATTTATAA 
ATTTTATAGC 
TGAAGTCCAC 
ATTTTTTATA 
TTGTATAATG 
ATGTTTATAC 
TTAATATGAG 
CAGTGGGATT 
CGGTATCGGC 
TGACAGACGA 



AACACAAACT 
TAGTGGCAAT 
TTTGGATGCC 
TGTCCCTATC 
ATCTATAACC 
TTCTAATAGT 
TGAACAATGG 
GGGAAATCTG 
GCTTTTAAAA 
GATGCTTCTT 
ACTGCCTCAA 
GATTTTAGAG 
TTCTATGAGT 
TTGAATGACT 
TTGAGCAATT 
TTTGACGGCT 
CTTGACGAAG 
GCCTACGGTT 
GGTAATGGTG 
AAAAAGAATG 
AGATTGGATT 
CCATTAAAAG 
ACTTACTTGT 
GTATGCAGAA 
ATAAGTAATT 
TTTTATGCTA 
CCGAAAAAGC 
GGTATCAGTT 
CGATAATATG 
GGAAAAATCC 



1090 
ACTGATGGTA 



GTTTTGGTTG 
ATCAATGCTA 
AAATGATCAA 
ATGGATCATA 
ATAATGGGAG 
CGTGCTGAAA 
GTCACTATGA 
ATCCTAACAA 
CAGAGGAATT 
ACCTTTCAAC 
CGTCTACGTC 
CTGGTGATGT 
TAGGTGTCAT 
CAGGTGCTAT 
CAGGAGCTAT 
CAGGTATCAT 
CAGGAGCTTT 
AATGGGTAAA 
AACGATAAGA 
CTTGTCGCCT 
CATAAGCGGT 
TGAGATGTTA 
TATTTTTAAG 
GTTGAAAAGT 
TAAAATATAG 
ATTGGTGAGA 
ACTGTATCTA 
GAAAGTCCAA 
TTGCTGGTAG 



AAAATGAAAA 
GAATCAAAGG 
TCTGTAAAGA 
CTGACCTAGA 
CCCGTCTTTC 
AAAATTTGGC 
AAGCAGATTA 
GTCGCTAATT 
TCAATACAAA 
GGCTGGTAGA 
AGTTAAAACT 
TGATCAGTCT 
GAAGACAGGT 
GCAGACTGGA 
GTTTACAGGC 
GAAGACAGGC 
GAAGACAGGT 
GGCTGTGAGC 
CTAGGCTCAG 
AAGAGGTTGA 
CAATTTCAGA 
AATCTAATTT 
TCTCTGTTTT 
TATATTTCAA 
ACTCAGAAAA 
ATTGATATAA 
AACGCCATAG 
GTTTCTTTTT 
TTCATTATAA 
AGGCCCTTCC 



TGTATTGACC 
AGAATTTGTG 
AGCGGCTGAC 
AAAGGCAGCT 
TAGCAAAGAT 
ATGGAATCAT 
TGTGGAGAAA 
AACCCTAAAT 
GCTATTACAA 
TATGGTTCTG 
AAAGCTACGG 
GGTTGGGTGG 
TGGGTGAAAA 
TTTGTAAAAT 
TGGGGAACAG 
TGGTACAAGG 
TGGTTTAAAG 
ACAACAACAC 
GCCATAGGTA 
TGGCGAACAT 
CTTTTCTATT 
AAAAAATGCT 
TTATCGTTA? 
TAGAAATTTC 
TTCCATACTA 
AGAATATAGA 
TTTCTCGATG 
GATGAGTCAA 
GTATATGACC 
ACAACTGGCT 
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GAAGAATCAG ATGATACTTA TTACTTGGTT TATAGATCTC AACAGTTTTT ACCGAATACA 9120 

GGTTTTAACC CAACTGTTGG TACTTTCCTT TTTACTGCAG GATTGAGCTT GTTAGTTTTA 9180 

TTGGTTTCTA AAAGGGAAAA TGGAAAGAAA CGACTTGTTC ATTTTCTGCT GTTGACTAGC 9240 

ATGGGAGTTC AATTGTTGCC GGCCAGTGCT TTTGGGTTGA CCAGCCAGAT TTTATCTGCC 9300 

TATAATAGTC AGCTTTCTAT CGGAGTCGGG GAACATTTAC CAGAGCCTCT GAAAATCGAA 9360 

.GGTTATCAAT ATATTGGTTA TATCAAAACT AAGAAACAGG ATAATACAGA GCTTTCAAGG 9420 

ACAGTTGATG GGAAATACTC TGCTCAAAGA GATAGTCAAC CAAACTCTAC AAAAACATCA 94 80 

GATGTAGTTC ATTCAGCTGA TTTAGAATGG AACCAAGGAC AGGGGAAGGT TAGTTTACAA 9540 

GGTGAAGCAT CAGGGGATGA TGGACTTTCA GAAAAATCTT CTATAGCAGC AGACAATCTA 9600 

TCTTCTAATG ATTCATTCGC AAGTCAAGTT GAGCAGAATC CGGATCACAA AGGAGAATCT 9660 

GTAGTTCGAC CAACAGTGCC AGAACAAGGA AATCCTGTGT CTGCTACAAC GGTGCAGAGT 9720 

GCGGAAGAGG AAGTATTGGC GACGACAAAT GATCGACCAG AGTATAAACT TCCATTGGAA 9780 

ACCAAAGGCA CGCAAGAACC CGGTCATGAG GGTGAAGCCG CAGTCCGTGA AGACTTACCA 9840 

GTCTACACTA AGCCACTAGA AACCAAAGGT ACACAAGGAC CCGGACATGA AGGTGAAGCT 9900 

GCAGTTCGCG AGGAAGAACC AGCTTACACA GAACCGTTAG CAACGAAAGG CACGCAAGAG 9960 

CCAGGTCATG AGGGCAAAGC TACAGTCCGC GAAGAGACTC TAGAGTACAC GGAACCGGTA 10020 

GCGACAAAAG GCACACAAGA ACCCGAACAT GAGGGCGAAg cGGCAGTAGA AGAAGAACTT 10080 

CCGGCTTTAG AGGTCACTAC ACGAAATAGA ACGGAAATCC AGAATATTCC TTATACAACA 10140 

GAAGAAATTC AGGATCCAAC ACTTCTGAAA AATCGTCGTA AGATTGAACG ACAAGGGCAA 10200 

GCAGGGACAC GTACAATTCA ATATGAAGAC TACATCGTAA ATGGTAATGT CGTAGAAACT 10260 

AAAGAAGTGT CACGAACTGA AGTAGCTCCG GTCAACGAAG TCGTTAAAGT AGGAACACTT 10320 

GTGAAAGTTA AACCTACAGT AGAAATTACA AACTTAACAA AAGTTGAGAA CAAAAAATCT 10380 

ATAACTGTAA GTTATAACTT AATAGACACT ACCTCAGCAT ATGTTTCTGC AAAAACGCAA 10440 

GTTTTCCATG GAGACAAGCT AGTTAAAGAG GTGGATATAG AAAATCCTGC CAAAGAGCAA 10500 

GTAATATCAG GTTTAGATTA CTACACACCG TATACAGTTA AAACACACCT AACTTATAAT 10560 

TTGGGTGAAA ATAATGAGGA AAATACTGAA ACATCAACTC AAGATTTCCA ATTAGAGTAT 10620 

AAGAAAATAG AGATTAAAGA TATTGATTCA GTAGAATTAT ACGGTAAAGA AAATGATCGT 10680 

TATCGTAGAT ATTTAAGTCT AAGTGAAGCG CCGACTGATA CGGCTAAATA CTTTGTAAAA 10740 

GTGAAATCAG ATCGCTTCAA AGAAATGTAC CTACCTGTAA AATCTATTAC AGAAAATACG 10800 
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GATGGAACGT 
TACAAAGATG 
ACATCCTTTA 
GCTTCAGATA 
GGTGCATTTA 
TTGAAGAAAC 
ACTGTTTCTG 
AATATTAATA 
GTAGCGAGCG 
AATCACCAGG 
AGTTCGAGAG 
AACCAAACAG 
GTTGCTACTG 
TCTACGTGGC 
TATGTTATCA 
GATAATAGAA 
GTTGCTGATT 
CTAAGAGAAG 
AGCAACATAG 
AAAGTAGCGA 
GATGATGAAG 
CATTTCAAAG 
AGTCAAGTAA 
GTTTCAGACT 
AACTCAGAAG 
TACTTAGATA 
TTAGCGATGG 
AAAATCAAAA 
GATATTAATT 
GGAAATAATG 



ATAAAGTGAC 
ATTACACATT 
AACAGCTGGT 
TGACCGCAGA 
CAGGGAGCTT 
CATTATTTGA 
CTGATAGTAA 
ATGTTGCAGT 
CAACAAATAC 
ACAGTAATAA 
TTAATAAAGT 
CTGGAGGGAT 
GAGAAATACG 
AAAACGGTCG 
CCGGTGATCA 
AAGCAGACAG 
ATGGAATCAC 
TTGATTATAC 
AAAAACTGAT 
CAACAGATAA 
TAGTAACGGA 
ATAATACAGT 
TCGAATACAA 
ATACAGCGAT 
CTACTAAAAA 
GACAATTTGA 
ATAAATCAAT 
ATAACAAAGA 
ATGGTAAAAT 
AGACTTCAAC 



GGTAGCCGTT 
TACTGTAGCT 
AACAGCCATG 
TGAGGTGAGC 
GATCGGTTCT 
TACATTAAAT 
AGAAAATGTC 
AGAAGGAAAA 
AGTGATAGAA 
AAATGATACT 
TAGGGTAGAT 
AGTAGGTAGA 
AAATGGTCAA 
AGTAAATAAT 
ATACGCAGCA 
ATTCGCTACA 
AGTAACTCTT 
AAGACTAAAT 
GCCATTCTAC 
ACTTTACACT 
TATTAATAAT 
AGAATACCTA 
TGTTACAGGA 
AACGAATAAC 
AGTACTAGGA 
AGAAGTTAAA 
CAATACTACA 
AGCATTTATG 
GAATACAAAA 
GTTGGATACT 



1092 
GATCAACTTG 



AAATCTAAAG 
CAAAGCAATC 
TTAGGCGATA 
GATGGAACAA 
GGTGCTACAG 
GCAGCGCTGG 
ATCTCAGGTG 
AACAGCTCGT 
GGAGGAATAG 
GCCTTAATCT 
TTAGAAAATG 
GGATATTCTA 
GTTGTGAGTA 
GCAGATGTGA 
AAATTATCAA 
GATGATACTG 
AAAGCAGAAG 
AATAAAGACC 
ACAGAATTGT 
AAGAAAAATT 
GATGTAACAT 
AAAGAATATA 
GTACTAAGCG 
GCAGCGAATG 
GCTAATATAG 
GGAGACGGTG 
CTAGGTCTTA 
GATTTATCTA 
ATTGTCGCAT 



TCGAAGAAGG 
CAGAGCAACC 
TGTCTGGTGT 
AGCAGACAAG 
AATCGTATGC 
TTAGAGATTT 
CGAAGGCAGC 
CGAAATCTGT 
TTACAGGGAA 
TAGGTAATAT 
CTACTAATGC 
GTGCATTGAT 
GAGTCGGAGG 
ACGTAGATGT 
AAAATGCAAG 
AAGACCAAAT 
GGCAAGATTT 
CTGAAAGAAA 
TAGTAGTTCA 
TAGATGTTGT 
CAATAAATAA 
TCAAAGAAAA 
TATTCACACC 
ACTTGCAAAA 
ATGCAGCCTT 
CAGAACACCT 
TAGTTGAATA 
CTTATATGAA 
CGTACAAGTT 
TAGGAAATAG 



TACAGACGGT 
AGGAGTTTAC 
CTATACATTG 
TTATCTCACA 
CATTTATGAT 
GGATATTAAA 
GAATAGCGCG 
TGCGGGATTA 
ACTTATCGCA 
AACAGGAAAT 
ACGCAATAAT 
ATCTAATTCG 
AATAGTAGGA 
TGGAGATGGT 
TACATCAGTT 
AGACGCGAAA 
AAAACGTAAT 
AGTAGCTTAT 
CTATGGTAAC 
GCCGATGAAA 
AGTTATGTTA 
CTTCATAAAC 
AGAAGCATTT 
TGTAACACTT 
AGATAACCTA 
AAGAAAAGTA 
CGTAAGTGAG 
CCGTTGGTAC 
TGACTTTAAC 
TGGACTAGAT 
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11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 



1 



WO 98/18931 



PCT/US97/19588 



1093 

AACCTGAGAG CTTCAAATAC TGTAGGTTTA TATGCGAATA AACTTGCATC GGTAAAAGGA 
GAAGATTCAG TCTTTGACTT CGTAGAAGCG TATAGAAAAC TGTTCTTACC AAACAAAACA 
AATAACGAGT GGTTTAAAGA AAATACAAAG GCATATATAG TCGAAATGAA GTCTGATATT 
GCAGAAGTAC GAGAAAAACA AGAATCACCA ACAGCCGATA GAAAATATTC ATTAGGAGTT 
TACGATAGAA TATCAGCACC AAGTTGGGGG CAT AAG ACTA TGTTATTACC ACTACTAACT 
1TACCTGAAG AATCTGTGTA TATTTCATCG AATATGTCTA CACTTGCATT CGGTTCGTAT 
GAAAGATATC GTGATAGTGT GGATGGAGTT ATTCTTTCAG GAGATGCTTT ACGAACTTAT 
GTAAGAAATA GAGTTGATAT AGCAGCGAAA AGGCATAGAG ACCATTATGA TATTTGGTAC 
AATCTTCTTG ACAGTGCTTC AAAAGAAAAA CTTTTCCGTT CTGTGATAGT TTATGATGGA 
TTCAATGTAA AAGATGAGAC AGGAAGAACT TATTGGGCAA GGTTAACGGA TAAAAACATC 
GGCTCTATTA AAGAATTCTT CGGACCTGTT GGGAAATGGT ATGAGTATAA TAGTAGTGCA 
GGAGCGTATG CGyAtGGAAG TTTAACGCAC TTTGTGTTAG ATAGATTATT AGATGCTTAT 
GGAACGTCGG TTTATACTCA TGAAATGGTT CATAATTCTG ATTCTGCAAT CTACTTTGAA 
GGAAATGGTA GACGTGAAGG ATTGGGAGCG GAGTTATACG CACTTGGTTT ACTGCAATCT 
GTAGATAGTG TAAATTCTCA TATTTTAGCT TTAAATACGT TATATAAAGC AGAAAAAGAT 
GATTTGAATA GATTGCATAC ATATAATCCG GTGGAACGTT TCGATTCGGA TGAGGCGCTT 
CAAAGTTATA TGCATGGATC ATATGATGTA ATGTATACAC TTGATGCGAT GGAAGCAAAA 
GCGATATTAG CTCAAAATAA TGATGTTAAG AAAAAATGGT TTAGAAAAAT AGAAAATTAT 
TACGTTCGTG ATACTAGACA TAATAAAGAT ACACATGCAG GAAATAAAGT CCGTCCATTA 
ACAGATGAAG AAGTAGCTAA CTTAACATCG TTAAACTCAT TAATCGACAA CGACATCATA 
AATAGACGTA GCTATGATGA TAGTAGAGAA TATAAACGAA ATGGCTACTA TACTATAAGT 
ATGTTCTCTC CTGTATACGC AGCGCTAAGC AATTCGAAAG GTGCTCCTGG AGATATTATG 
TTTAGAAAAA TAGCTTATGA ATTACTTGCG GAAAAAGGTT ATCACAAAGG ATTCCTACCT 
TATGTTTCTA ATCAGTACGG AGCAGAAGCA TTTGCCAGCG GAAGCAAAAC ATTCTCATCA 
TGGCATGGAA GAGATGTTGC TTTAGTGACA GATGATTTAG TATTTAAGAA AGTATTCAAT 
GGTGAGTACT CATCATGGGC TGATTTCAAA AAAGCAATGT TTAAACAACG TATAGATAAA 
CAAGATAATC TGAAACCAAT AACAATTCAA TACGAATTAG GTAATCCTAA TAGTACAAAA 
GAAGTAACTA TAACAACGGC TGCACAAATG CAACAATTAA TTAATGAAGC GGCTGCGAAA 
GATATTACTA ATATAGATCG TGCAACGAGT CATACCCCAG CAAGTTGGGT GCATTTATTA 



12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 
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AAACAAAAAA TCTATAATGC ATATCTTCGC ACTACAGATG ACTTTAGAAA TTCTATATAT 
AAATAAGATT GTAGAGTTTC ATTGTTGAGT AGTGTTTCTT GTAAGGATGA GGAGTCAGAT 
GACAAATCGA CTCCTTTTTC TTATGGATCG ATGTAGAGAT TTGATTGAAT GCAGATTGCA 
GGAATCATCT TCAACTCATC AACGACCAAT GGTGACAAGG TGGATTTCAA TCCCACAGAA 
AATGTTGATT TGAGAAATAA CTTTGCTAGT CTAGTAAAAT AAATACAAAA CAATCCTAGA 
AGATTTTTTC TGGGATTGTT TTTTGCTGAG TGGGATGCTT CAAGTTGTCT GGCTTGACTT 
TCTTGAGGGA AGTTATATAA TAGTTGTAAT AATTAG 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11770 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



14400 
14460 
14520 
14580 
14640 
14700 
14736 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
ACAGGAAAGC ACGATAGCAA TCTCTTTGGA AGATTTAAAA AATATTCCTC AAAGTTTCGC 
TGTTGCTTAC GGTGATACGA AAGTATCTTC GATTCTCTCT GTCTTGCGTG CTAATTTAGT 
AAATCATTTG ATTACAGACA AAAATACAAT TTTAAAAGTT TTGGAAGAAG ATGGGGATTT 
GACTTTTAGA GAGATTCTAG GTGAGTGAAA ATGATAGACT GATTCAGTTT ATCGTTTTTC 
TTTTTAGTTG ATTGCACATT TGTGCTTATA TAAACAAAAA TAGTTTATCT GTTGTTTTTG 
GATTGACAAC TTTATTATGT AGTTGTATTC TATAGTTACA AAAGAAAATT TTAAAATTTC 
AAATGAAAAA AGCTTTTTAC ATAGTGAAAT GAGGAGGAAT TTATGGAAAT GATTGTTCCA 
GATCAAATTA TCATGGGTTT AATTTTATAT GCTGGTGATG CGAAACAACA TATTTATAAA 
GCGTTAGATT ACATAAAAAA TGGTACATGT GAACGGTGTG AAGAAGAAAT ACAGTTAGCT 
GATGCAGCCT TATTAGAAGC TCATAATCTA CAAACAAAAT TTTTGGCACA GGAAGCGTCT 
GGTACAAAGA CAGAAATTAC AGCTCTCTTT GTTCATTCAC AAGATCATCT CATGACCAGT 
ATGACGGAGA TTAATTTAAT CAAAGAAATT ATTAGTTTGA GAAAAGAACT TCATAAAAAA 
TAATACTAGA GTATTATCAT TGTTATTAAC ATAGAGGAGG AAAACATAAT GGTGAAGATT 
GGTTTGTTTT GTGCAGCAGG TTTTTCTACT GGTATGCTTG TAAATAATAT GAAAATTGCA 
GCGCAATCTA GTGGAGTTGA GGCAGAAATA GAGGCGTTTT CTCAGTCTAA ATTAGCGGAT 
TATGCGCCAA ATATAGATGT TGCACTATTG GGTCCACAAG TTGCTTATAC ATTAGATAAA 
TCAAAAGAAA TTTGTGATAA GTGTGATGTT CCGATAGCTG TTATTCCGAT GATGGACTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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GGTATGTTAG ATGGGAAAAA AGTATTAGAT TTGGCCCTAT CTTTGATTAG TGGGTAAGAA 1080 

AAGGAGATTT ATTATGTCAA AGATGGATGT TCAGAAAATC ATTGCACCGA TGATGAAGTT 1140 

TGTGAATATG CGTGGCATTA TAGCTCTAAA AGATGGGATG TTAGCAATTT TGCCATTGAC 1200 

AGTAGTTGGT AGTTTGTTCT TGATTATGGG ACAATTGCCG TTCGAAGGAT TAAATAAGAG 1260 

CATTGCTAGT GTTTTTGGAG CTAATTGGAC AGAGCCGTTT ATGCAAGTAT ATTCAGGAAC 1320 

TTTTGCTATT ATGGGTCTAA TTTCTTGTTT TTCAATTGCC TATTCTTATG CTAAGAATAG 1380 

CGGAGTAGAG GCTTTACCAG CTGGAGTTCT ATCTGTATCT GCATTCTTTA TTTTGCTAAG 1440 

ATCATCTTAT ATCCCTAAAC AAGGTGAGGC GATTGGGGAC GCTATTAGTA AAGTTTGGTT 1500 

TGGAGGCCAA GGAATTATCG GTGCTATCAT TATAGGTTTG GTAGTAGGAA GTATTTATAC 1560 

CTTCTTTATA AAGAGAAAAA TTGTTATTAA GATGCCAGAA CAAGTTCCAC AAGCTATTGC 1620 

CAAACAGTTT GAAGCAATGA TTCCAGCATT TGTAATTTTC TTATCTTCTA TGATTGTATA 1680 

TATTTTAGCG AAGTCATTGA CTAATGGCGG AACATTCATA GAAATGATTT ATTCTGCTAT 1740 

TCAAGTTCCG TTGCAAGGTT TAACTGGATC TTTGTATGGT GCTATTGGAA TTGCATTCTT 1800 

TATATCATTT TTGTGGTGGT TTGGTGTTCA TGGGCAATCG GTAGTAAATG GAGTAGTGAC I860 

AGCTCTGCTT TTATCTAATC TTGATGCTAA TAAAGCTATG TTAGCCTCTG CTAATCTATC 1920 

ATTAGAAAAT GGTGCACATA TTGTTACTCA ACAATTTTTA GATTCATTTT TAATTCTATC 1980 

AGGTTCAGGG ATTACGTTTG GTCTTGTAGT TGCCATGCTT TTTGCAGCAA AATCAAAACA 2040 

ATACCAAGCC TTAGGAAAAG TTGCAGCTTT TCCAGCAATA TTTAACGTAA ATGAGCCAGT 2100 

TGTATTTGGA TTTCCGATTG TCATGAATCC AGTTATGTTT GTACCTTTCA TTCTTGTTCC 2160 

TGTACTTGCA GCTGTGATAG TATATGGAGC TATTGCAACA GGTTTCATGC AGCCATTCTC 2220 

AGGGGTAACA TTGCCTTGGA GTACACCAGC TATTTTATCA GGATTTTTGG TGGGTGGATG 2280 

GCAAGGAGTT ATTACTCAGC TGGTGATATT AGCGATGTCT ACATTGGTTT ATTTTCCATT 2340 

CTTTAAAGTA CAGGATCGTT TAGCTTACCA AAATGAAATC AAACAATCTT AGAGGTATTT 2400 

GTGTGTTACT GTTAAACTCA CACATTTGTG CTAAAAATTA GAGAGTTAAA ATTTTTCTAG 2460 

TTAAAAGCTT GAAAATTTCT ATAAAAATCG GTATTATATT TTCGAAAGAA ATAAAAATAT 2520 

TTTCGAAAGA AAGGTGCTTA CGATGGTAAA TACAGAAGTA GCAAGAACAA CAATCAAGAC 2580 

AGAATATTTT GGCAGCCTTA CTGAAAGGAT GAACAAATAT CGAGAAGATG TTTTAAATAA 2640 

AAAACCTTAT ATTGATGCTG AGAGAGCAGT TCTAGCAACA CGCGCCTATG AACGATACAA 2700 

GGAACAACCT AATGTCCTAA AACGTGCATA TATGCTGAAA GAAATTTTGG AAAATATGAC 2760 



WO 98/18931 



PCT/US97/19588 



TATCTATATT 
TCCTATTTTT 
GCGTGATGGA 
TCCGTTTTGG 
GTCTGTTTAT 
TCACTTAGCA 
GGCTCGTAAA 
TTTTTACGAC 
TGCTCTTGCT 
GATTGCAGAT 
TCAATCAGTT 
ATATGGCCGT 
AGAAACAGAA 
TAATAAGGTT 
TGTTACAATT 
GGTATTAAAA 
TGCAGGTTTA 
TATGCCTGCA 
GGAAGATGAT 
ATGGGGCTAT 
GATGAATGAT 
TAAGGATATG 
GACACGAATG 
TGATATTCTA 
AGGTGGAGCA 
TTCATTAGCT 
TTGGCATGCA 
GATTCATGAT 
TGCTTATGAC 
GCCTATTGGA 



GAAGAAGAAT 
CCGGAATATA 
GATGTTTTCT 
GAAAATAATA 
ATGGAAACAG 
GTTAACTATC 
GCAAAAGTAG 
TCTATATTTA 
AAAAGTTTAG 
ATTTCCTCTA 
TGGTTTATTC 
TTTGATCAAT 
GATAGCATTG 
CGCAGTCAAT 
GGTGGACAGA 
TCAGTTGCAC 
GATGCTCGTT 
TTTAATAATG 
GCTTATGATT 
CGTTGCACAG 
GGAATTGATC 
AAGAACTTTT 
AGTGTTATTG 
TGTTCAGCAT 
GTATATGATT 
GCAATTAAAA 
CTGGAAACAG 
GCACCTAAGT 
ATTTATGTTG 
GGAATTCGTT 



CTATGATTGC 
CGCTAGAATT 
ATATTACAGA 
ATTTACGTGC 
GATTCTTCGG 
AGAAACTTTT 
CTCTAGATTT 
TCGTAATCGA 
CCGAAAATGC 
GAGTCCCATA 
AATGTATTTT 
ATATGTATCC 
TTGAACGTCT 
CACATACATT 
CTCGAGATAA 
AAACCCATCT 
TCATGAATGA 
ATGAGATTAT 
ACAGTGCCAT 
GTATGAGTTA 
CGGCTTCGGG 
CTGAATTAGA 
TTGAAAATTC 
TGACTGATGA 
ATATATCAGG 
AATTGGTGTT 
ATTATGCCGG 
ATGGTAATGA 
ATGAAATTGC 
ATTCAGGAAC 



1096 
GGGAAATCAA 



TGTTCTCAAT 
AGAAACAAAA 
TAGAGCTGGT 
TATGGAAGGT 
GCAATTTGGT 
AACAGATCCA 
TGCTATTAAA 
AAATCCTAAA 
TGAACCGGCA 
ACAAATTGAA 
ATATATGAAG 
GACAAATCTT 
TTCTTCAGCA 
GAAGGATGCT 
ACCGCAACCT 
GTGTATTGAA 
TATTCCTTCT 
TGGATGTGTT 
TATGAACTTC 
TAAACGGTTT 
AAATGCTTGG 
TATTGATTTA 
TTGTATTGGT 
ATTGCAAGTT 
TGAGGAAGAA 
AGAAGAAGGT 
TGATGATTAT 
TAAATATCCT 
ATCTTCTATC 



GCTTCTTCCA 
GAGTTGGATC 
GAACAACTTA 
GCCTTATTAC 
AAGATGAATT 
TTAAGAGGTT 
GCAAGTATTG 
GTATATGCAA 
CGTAAGAAAG 
ACTACTTTTG 
TCTAATGGCC 
GCTGATTTAG 
TGGATTAAGA 
GGAAGTCCTT 
GTTAACCCAT 
AATCTAACTG 
GTGATGAAAC 
TTTATTGCAA 
GAAACGGCAG 
CCTAAGGTTC 
GCACCAAGCT 
GATAAAACAC 
TCATTGGAAC 
CGTGGAAAAC 
GGAATTGCAA 
CGTATAAGCC 
AAGGTCATTC 
GCTGACAAAT 
AATACACGTT 
TCAGCCAACG 



ATAAAGATGC 
TTTTTGAAAA 
GAAGTATTGC 
CTGAAGAAGT 
CTGGAGATGC 
TTGAAGAGCG 
ATAAATATCA 
AGCGCTTTGT 
AATTACTTGA 
CAGAAGCTAT 
ACTCTCTTTC 
AAAGTGGTAA 
CAATTACAAT 
TATATCAAAA 
TATCTTATTT 
TACGTTACCA 
TTGGTTTTGG 
AAGGAGTATT 
TTCCAGGGAA 
TACTTATCAC 
TTGGTCGTTT 
TAAGATATTT 
GAGAAGTTCC 
ACCTTAAAGA 
ATTTGTCGGA 
CAAGTCAGCT 
AAGAAATGTT 
TGGTTACTGC 
ATGGAAGAGG 
TAGGGCAGGG 



2820 

2S80 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 
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ACGTGGAACA TTAGCAACTC CAGATGGACG CAACGCGGGT ACACCGTTAG CAGAGGGTTG 4620 

TTCACCATCA CATAATATGG ATCAACACGG CCCTACATCT GTTTTAAAAT CTGTTTCAAA 4680 

ATTACCAACA GATGAAATCG TAGGTGGGGT TCTCTTAAAT CAGAAAGTAA ATCCTCAAAC 4740 

GTTAGCCAAA GAAGAAGATA AATTAAAACT AATTGCTTTG TTACGAACAT TCTTTAATCG 4800 

TTTACATGGG TACCATATTC AATACAATGT TGTTTCCAGA GAGACGCTGA TTGACGCTCA 4860 

. GAAACATCCT GAAAAACACA GAGACTTAAT TGTTCGTGTT GCAGGATACT CTGCATTCTT 4920 

CAATGTTCTT TCTAAGGCAA CCCAAGATGA CATTATAGGA CGTACTGAGC ATACTTTGTA 4980 

AAATAAAGAG GTTCTTTTTA TGGAATTTAT GCTTGACACA TTAAATTTAG ATGAGATTAA 5040 

AAAGTGGTCT GAAATTTTGC CGCTAGCTGG GGTAACTTCA AATCCCACTA TTGCAAAAAG 5100 

AGAGGGTTCT ATTAATTTTT TTGAACGAAT CAAAGATGTA AGAGAATTGA TTGGCTCTAC 5160 

ACCCTCTATT CATGTTCAGG TGATTTCTCA AGATTTTGAA GGCATCTTAA AGGATGCTCA 5220 

TAAAATTCGA AGACAAGCAG GAGATGATAT ATTTATCAAA GTACCTGTTA CTCCAGCTGG 5280 

ATTACGTGCA ATAAAGGCGC TAAAAAAAGA GGGCTACCAT ATCACTGCAA CAGCTATTTA 5340 

TACAGTTATT CAGGGATTAT TAGCTATCGA AGCAGGAGCG GATTACCTAG CTCCATATTA 5400 

TAATAGAATG GAAAATCTGA ACATTGATTC AAATTCTGTC ATTCGTCAAT TAGCTCTTGC 5460 

TATTGATAGA CAGAACTCTC CTAGTAAGAT TTTAGCTGCA TCCTTTAAAA ATGTAGCACA 5520 

AGTAAATAAT GCTTTAGCTG CAGGTGCGCA TGCTGTTACA GCAGGAGCGG ATGTTTTTGA 5580 

ATCAGCTTTC GCCATGCCAT CTATCCAAAA GGCGGTTGAT GATTTTTCTG ACGATTGGTT 5640 

TGTTATTCAA AATAGTCGTT CCATTTAGAT AGAGAGGAAA TACATATGAG AATTTTTGCT 5700 

AGTCCTTCTA GATATATTCA GGGGGAAAAT GCCTTGTTTG AAAATGCCAA ATCAATTTTG 5760 

GATTTGGGAA ATTGCCCTAT TCTATTATGC GATCAGTTGG TTTATGATAT TGTTGGAAAA 5820 

CGATTTGAAG ATTACCTACA TAGGTATGGT TTCCATATTG TTCTGGCGCT ATTTAATGGT 5880 

GAAGCTTCTG ACAATGAAAT CAATCGAGTT GTTGCCTTGG CTGAGAAAGA AAATTGTGAT 5940 

AGTATTATCG GTCTTGGTGG GGGAAAGACG ATTGATAGCG CAAAAGCTAT TGCAGATTTG 6000 

ATTGAAAAGC CTGTTATTAT TGCTCCAACA ATTGCATCGA CCGACGCACC TGTATCTGCT 6060 

TTATCTGTTA TT TAT AC AG A TGAAGGTGCA TTTGATCATT ATCTATTTTA TTCTAAAAAT 6120 

CCAGATTTAG TTTTGGTTGA TACAAAAGTT ATTTCACAAG CCCCTAAGCG TTTATTAGCG 6180 

TCTGGTATTG CAGATGGTTT AGCAACTTGG GTTGAGGCGC GTGCGGTTAT GCAGGCAAAT 6240 

GGAAAAACTA TGTTGGGACA ACAGCAAACA TTGGCTGGAG TTGCAATTGC GAAGAAATGT 6300 
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GAAGAAACGC TGTTTGCAGA TGGTTTACAG GCTATGGCAG CTTGTGAAGC TAAAGTGGTG 6360 

ACACCAGCAT TAGAAAATAT TGTTGAAGCT AATACTTTAT TGAGTGGTCT AGGTTTTGAA 6420 

AGTGGAGGAT TAGCTGCGGC GCATGCAATT CATAATGGTT TTACTGCATT GACAGGTGAC 6480 

ATTCATCATT TAACACATGG TGAAAAAGTA GCTTATGGAA CTTTAGTACA ACTATTATTG 6540 

GAAAATAGAC CTAAAGAAGA ACTTGATAAG TATATTGAGT TTTACAAAAA AATTGGTATG 6600 

CCAACAACTC TAAAAGAAAT GCATTTGGAT CAAGTTGGAT ATGATGATTT AATAAAAGTT 6660 

GGTAAACAAG CAACTATGGA GGGTGAGACA ATTCATCAGA TGCCGTTTAA GATTTCGCCT 6720 

TCAGATGTTG CTCAAGCTAT TATCGCTGTA GATGCCTATG TAAATTCAAA ATAAACAATA 6780 

AGGACTACTG TTTTCCAAAT GGTAGTCTTT TATTGATCCC TGTATTGAAT TCTATAGAAG 6840 

ATTGAAATAG GATGAGAACA AATCGATTGG GAAAGTAAAA TTAATTTCTA TAAATGTTTT 6900 

AGCAATTGTT TCGTACTATT TCAGATTCAG TCTACTATAT GTTCTTCATA AATCAAAAAG 6960 

CGACATAGGT TGTCGGCTAT TTATTGTGAA TACATTAATT AGCATTCCAG TTTTATCTTC 7020 

GGTCTAAAAT AAGTATTTTG TGCTATACGA GATAAGCTTC TTG ACT TACT CCTTGATTTA 7080 

CTGCATAACA ATGGGATAAA AAGTGGGAGA TAGAGCAATT CATAGTCATC AAAATTAATG 7140 

AGATACAGTA TACAGTTTTT CCTTTAAACA CATTTCAAAT TCCCTCAAAA ATGGTATAAT 7200 

AGTAACATCA CAAAATTGGA GAGAGACCAT GAGTTTTTAC AATCATAAAG AAATTGAGCC 7260 

TAAGTGGCAG GGCTACTGGG CAGAACATCA TACATTTAAG ACAGGAACAG ATACATCAAA 7320 

ACCTAAGTTT TATGCGCTTG ATATGTTCCC TTATCCGTCT GGAGCTGGTC TGCACGTAGG 7380 

ACACCCAGAA GGTTATACTG CAACCGATAT CCTCAGTCGT TACAAACGTG CGCAAGGCTA 7440 

CAATGTCCTT CACCCAATGG GTTGGGATGC TTTTGGTTTG CCTGCAGAGC AATACGCTAT 7500 

GGATACTGGT AATGACCCAG CAGAATTTAC AGCGGAAAAC ATTGCCAACT TCAAACGTCA 7560 

AATTAATGCG CTTGGATTTT CTTATGACTG GGATCGTGAA GTCAACACAA CAGATCCAAA 7 620 

CTACTACAAG TGGACTCAAT GGATTTTCAC CAAGCTTTAC GAAAAAGGCT TGGCCTATGA 7680 

AGCTGAAGTG CCAGTAAACT GGGTTGAGGA ATTGGGAACT GCCATTGCCA ATGAAGAAGT 7740 

GCTTCCTGAC GGAACTTCTG AGCGTGGAGG CTATCCAGTT GTCCGCAAAC CAATGCGCCA 7800 

ATGGATGCTC AAAATCACGG CTTACGCAGA GCGCTTGCTC AATGACTTAG ATGAACTAGA 7860 

TTGGTCAGAG TCTATCAAGG ATATGCAACG CAACTGGATT GGTAAATCAA CTGGTGCCAA 7920 

TGTAACTTTC AAAGTAAAAG GAACAGACAA GGAATTTACA GTCTTTACTA CTCGTCCGGA 7980 

CACACTTTTC GGTGCGACTT TCACTGTCTT GGCTCCTGAA CATGAATTAG TAGACGCTAT 8040 

CACAAGTTCA GAGCAAGCAG AAGCTGTAGC AGACTATAAA CACCAAGCCA GCCTTAAGTC 8100 



WO 98/18931 



PCT/US97/19588 



1099 



TGACTTGGCT CGTACAGACC 


TTGCTAAAGA 


AAAAACAGGG 


GTTTGGACTG 


GTGCTTATGC 


8160 


CATCAACCCT GTCAATGGTA 


AGGAAATGCC 


AATCTGGATT 


GCAGACTATG 


TCCTTGCTAG 


8220 


TTATGGAACA GGTGCGGTTA 


TGGCTGTGCC 


TGCCCACGAC 


CAACGTGACT 


GGGAATTTGC 


8280 


CAAACAATTT GACCTTCCAA 


TCGTCGAAGT 


ACTTGAAGGT 


GGAAATGTCG 


AAGAAGCTGC 


8340 


CTACACAGAG GATGGCCTGC 


ATGTCAATTC 


AGACTTCCTA 


GATGGATTGA 


ACAAAGAAGA 


8400 


CGCTATTGCC AAGATTGTGG 


CTTGGTTGGA 


AGAAAAAGGC 


TGTGGTCAGG 


AGAAGGTTAC 


8460 


CTACCGTCTC CGCGACTGGC 


TCTTTAGCCG 


TCAACGTTAC 


TGGGGTGAGC 


CAATTCCAAT 


8520 


CATTCATTGG GAAGATGGAA 


CTTCAACAGC 


TGTTCCTGAA 


ACTGAATTGC 


CGCTTGTCTT 


8580 


GCCTGTAACC AAGGATATCC 


GTCCTTCAGG 


TACTGGTGAA 


AGTCCACTAG 


CTAACTTGAC 


8640 


AGATTGGCTT GAAGTGACTC 


GTGAAGATGG 


TGTCAAAGGT 


CGTCGTGAAA 


CCAACACTAT 


8700 


GCCACAATGG GCTGGTTCAA 


GCTGGTACTA 


CCTCCGCTAT 


ATTGACCCGC 


ACAATACTGA 


8760 


GAAATTGGCT GATGAGGACC 


TCCTCAAACA 


ATGGTTGCCA 


GTAGATATCT 


ACGTGGGTGG 


8820 


TGCGGAACAT GCTGTACTTC 


ACTTGCTTTA 


TGCTCGTTTC 


TGGCATAAAT 


TCCTCTATGA 


8880 


CCTCGGTGTT GTTCCGACTA 


AGGAACCATT 


CCAAAAACTC 


TTTAACCAAG 


GGATGATTTT 


8940 


GGGAACAAGC TACCGTGACC 


ACCGTGGTGC 


TCTTGTGGCA 


ACCGACAAGG 


TTGAAAAACG 


9000 


TGATGGTTCC TTCTTCCATG 


TAGAAACAGG 


GGAAGAGTTG 


GAGCAAGCGC 


CAGCCAAGAT 


9060 


GTCTAAATCG CTCAAGAACG 


TTGTTAACCC 


AGACGATGTG 


GTGGAACAAT 


ACGGTGCCGA 


9120 


TACCCTTCGT GTTTATGAAA 


TGTTTATGGG 


ACCACTCGAT 


GCTTCGATTG 


CTTGGTCAGA 


9180 


AGAAGGTTTG GAAGGAAGCC 


GTAAGTTCCT 


TGACCGAGTT 


TACCGTTTGA 


TTACAAGTAA 


9240 


AGAAATCCTT GCGGAAAACA 


ATGGTGCTCT 


TGACAAGGTT 


TACAACGAAA 


CAGTCAAAGC 


9300 


TGTTACTGAG CAAATTGAGT 


CTCTCAAATT 


CAACACAGCT 


ATTGCCCAAC 


TTATGGTCTT 


9360 


TGTCAATGCT GCTAACAAGG 


AAGATAAGCT 


TTATGTTGAC 


TATGCCAAAG 


GCTTTATTCA 


9420 


ATTGATTGCA CCATTTGCAC 


CTCACTTGGC 


AGAAGAACTC 


TGGCAAACAG 


TCGCAGAAAC 


9480 


AGGtGAGTCA ATCTCTTATG TAGCTTGGCC AACTTGGGAC GAAAGCAAAT 


TGGTTGAAGA 


9540 


TGAAATTGAA ATTGTCGTCC 


AAATCAAAGG 


AAAAGTTCGT 


GCCAAACTCA 


TGGTTGCTAA 


9600 


AGATCTATCA CGTGAAGAAT 


TACAAGAAAT 


CGCTTTAGCT GATGAAAAAG 


TCAAAGCAGA 


9660 


AATTGACGGT AAGGAAATCG 


TGAAAGTAAT 


TGCGGTACCG 


AATAAACTCG 


TTAATATCGT 


9720 


CGTTAAATAA CGAGTTTATT 


AGCTCTATCT 


GCCACCTTCA 


ATAGTCCACT 


GGACTATTGA 


9780 


AsCCAACTAA ATTAGTTAAC 


ATTGTTGTGA 


AATAAGATAG 


GAGTCCTTCA 


GAGTAGAATC 


9840 
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1100 

TGGAGGATTT TTTGAATCTT CTTATGAAAG TATGATATAC TATGGGCAAC TATAAAGTTT 9900 

GAAAAGTGAA ATAAGGAGAA TAAGATGCCA GTAAATGAAT ATGGTCAAAT GATTGGGGAG 9960 

TCAATGGAAG CTTATACTCC AGGTGAATTG CCTTCTTTTG ATTTCTTAGA AGGGCGTTAT 10020 

GCTAGGATAG AGGCTCTTTC AGTGGAAAAG CATGCGGAGG ATTTATTAGC TGTTTATGGC 10080 

CCTGATACGC CTCGGGAGAT GTGGACCTAC CTCTTTCAGG AGTCAGTAGC AGACATGGAG 10140 

GAACTGGTCA GCCTTTTAAA TCAGATGTTG GCTCGTAAGG ACCGTTTTTA TTATGCAATC 10200 

ATAGACAAGG CAACTGGTAA GGCTTTGGGA ACTTTTTCCC TCATGCGAAT TGATCAGAAT 10260 

AACCGAGTAA TAGAAGTGGG AGCTGTCACT TTTTCTCCAG AGCTCAGGGG GACACGGATA 10320 

GGAACAGAAG CCCAGTATCT CTTGGCTTGC TATGTCTTTG AGGAGCTTAA CTATCGTCGC 10380 

TATGAGTGGA AATGCGATGC TCTTAACCTG CCATCCAGAC GAGCAGCGGA ACGTTTGGGA 10440 

TTTATTTATG AAGGAACCTT CCGTCAGGCA GTGGTTTATA AGGGGCGTAC AAGAGATACG 10500 

GATTGGTTGT CTATGATTGA TAAGGACTGG CCTCAAGTCA AAGCTCGATT GGAAATATGG 10560 

TTGCGTCCTG AAAACTTTGA TAAAAATGGA CGACAGCACA AGAGCTTGAG AGAACTTTAA 10620 

GAGGTGTTGA GATGATTACT ATTAAAAAGC AAGAAATTGT CAAGCTAGAG GATGTTTTGC 10680 

ATCTCTATCA GGCTGTCGGT TGGACAAACT ATACCCATCA AACAGAGATG CTGGAGCAGG 10740 

CCTTATCTCA TTCATTAGTA ATTTATCTGG CACTTGATGG TGATGCTGTG GTGGGCTTGA 10800 

TTCGTTTGGT TGGAGATGGT TTTTCATCAG TTTTTGTACA GGATTTGATT GTTTTGCCTA 10860 

GCTATCAGCG TCAAGGGATT GGTAGCTCCT TGATGAAAGA GGCTTTAGGA AATTTTAAAG 10920 

AGGCCTATCA AGTCCAGCTG GCGACAGAAG AGACAGAAAA AAACGTGGGA TTTTATCGTT 10980 

CTATGGGCTT TGAAATCTTA TCCACCTATG ACTGTACAGG AATGATTTGG ATAAACAGAG 11040 

AAAAATAAAA AAACTTGTTT GTTCTTAAGC AAAGTTTAAG GATGGTCTAG TATCATATAG 11100 

TCATTAAATA AAGACCTCCT AACTTTATTT AATAAAATCC TAAACTTTTT TCATCACAAT 11160 

CTCCTAATGA AGCCACCCAA TCAGGTGGCT TTTTTGCGGT ACGACGGGCA TGTCGTATAT 11220 

CTGAGGTGTA AGTCCTCAGC CTGACTATCG TGAGGTAGCA GGGAGAGGAA GGGATAGCGA 11280 

AATCGTGGCT CTACGAACAG GAACGTGATA GTAAGGCGTA TATAGCGGAT AAGGAGGCTT 11340 

CAAACTCTAA AGTCCAAAAA GGTAGTCGTA ACCTATATGT GTAAATCACG AGAGTAATTG 11400 

AATTCGGACT AAGGTTTGTG TGAAAAAGAT AAATCTTTCT AGAGTCTAAA GACTCTGCGT 11460 

CAGATTTCCT ATTTTCACTG TAACCTTTTA ACGTCCTCAT ATCTTGTATA AACGAGGAAA 11520 

GATGTACGAC TTATCCCGTG AGGTTTCATG AGCGCTGAAA GCGTAGTAAC AACGAATCAT 11580 

GAGAAGTCAG CCGAGCCCAT AGTAGTGAGG AAACTTCCGT AATGGAAGTG GAGCGAAGGG 11640 
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GTGAATACTC AAACAGTCTG GGGAGAGACT GTTTGAGGTC TGTCGCTAGA AAGAGAAAAC 11700 

GACAGATCGA AGTAATCCTA CTTCACTTGT GTCTGTAAAA TGAGTGGTCT GATAGAACTG 11760 

GACTTTGAGG 11770 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

CGCGAAACTA CTTTCTTAGT ATAACACTTT CAGAATCATT GTCAATAGAA ATGACTTGAT 60 

TTTTTCAATT TTTTCAAGCT ATTTCCAAGG GTTGTAAAAT CGTCCCTGAT TCTGCAAGAT 120 

AAGTAGTAAA CTAACTACTA AAAACAAGGT TGCCAAGAGC AAGGTAATAT AGTCTCCTTT 180 

TTTCAAGGCC TGATAACTAT ACCATGTGCG TTTTTTCTCT TTCCCAAAGC GGCGAACTCC 24 0 

ATGGCAGTCG CAATGGTATC AATGCGTTCT AGCGAGCTAA AAATCAAGGG CGTAATAATG 300 

AGCAGATTGC CTTTGATTCG TTGCATAAGA GAAGCTTTCT TGGATAATTC CATCCCACGC 360 

GCCTCCTGAG ACATCTTGAT AGTAAAGAAT TCTTCCTGCA AATCTGGAAT ATAGCGCAAG 42 0 

GTCAGGCTGA CAGAATAAGC AATCTTATAG GGCACACCAA TTTGATTTAA ACTGGAAGCA 480 

AACTGACTAG GATGGGTTGT CATCAAAAAG ATAATAGCCA GAGGAATGGT GCAAAGATAC 540 

TTAATGGCCA AATTTAGCAG ATAAAAGAGC TCCTGGCTGG TTAGAGTGTA GACACCGATT 600 

CCCTGCCAAA TCACACTTCT CTCTCCATAA AGTCCAACCC CATACTCGGG AGAAAAGAGA 660 

TAGACCATCA AAACGTTTAA AACGGCAAAT ATCGTCGCAA AAACGGCTAC AAAGGAAACA 720 

TCTTTAAAGC GAATTTCTGA TAAATAGAGG AGAAAGACTG AAAAGATGGC AATCAGCAAG 780 

AGCATTCTGG TATCATAGCT AATCATGGCC GCCAATGATA CCAGAATGAA AAAGAGAAGT 840 

TTCCCAGCTC CTGACAAGCG ATGAATCACA GTATCTCTAT GCTGGTAACC GATTAATTTA 900 

GCTTGCATCC CTCTCTCCTT TCTTTGTAAA ATGCCGTTAA ATCCAGTGGA TCCACATCTA 960 

GTTTCTTAGC CAAGTTAAAG ATGGAGGTTT CTTTTAGATT GGCTTTTACT AACAGCTCAG 1020 

GATCGCTCAA CAGACTGGCT GGAACAGTAT CGGCAATCAA TTCTCCATCC ACCATGACAA 1080 

GGACCCGGTC TGAATAATCC AGCATCAATT GCATATCATG GGTAATCATG ACAATGGTAT 1140 

GCCCTTTTTG ATGTAACTCT TCGAGAAATT CCATAATCTC AGTATAGTTC TTCTGATCTT 1200 
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GACCTGCAGT CGGTTCATCT 


AGGAGAATAA 


TTTCAGCTCC 


TAAGACCAAA 


. ATTGAAGCAA 


1260 


TGGTGACACG TTTTTTCTGA 


CCAAATGACA 


GGGCAGAAAT 


AGGCCAATTA 


CGGAATTCAT 


1320 


AAAGTCCACA GATTTTCAAG 


GTTTCATATA 


CTCTCGTTTC 


AATTTCCTTC 


TCATCCACAC 


1380 


CTCGCAAACG GAGCCCTAGA 


GCCACCTCAT 


CAAAAATCAT 


ATTGGTTGAA 


ATCATTTGAT 


1440 


TAGGATTTTG TAGCACATAT 


CCTACTCGTT 


CCGCCCGCTC 


TGCAACAGAA 


TCGCCTTTTA 


1500 


TATCCTGTTT TTCCCAAAGA 


TAGCGTCCTT 


CCGTCTGAAT 


AAAGCTACTT 


ATAGCCTTGG 


1560 


CTAGAGTTGA TTTCCCTGCT 


CCATTTTTTC 


CGACAATAGC 


AATCTTTTCA 


CCCTTTTTAA 


1620 


TATCTAAATG TAGGGATTTT 


AAAATCGGTC 


TATCATCATA AGAAAAAGAT 


ACTTCCTCTA 


1680 


GTCTAAAGAG TGACTGCAAT 


GCTGGGGTTT 


CTTTTGCCAG 


TTCATTCTGC 


AACTGAACCT 


1740 


GACCTTTTGA GATAGACAAG 


TTATCCAGAT 


TCGCTAATTG 


TTCTTCCTTG 


ACTAAGTCCA 


1800 


CACCTAATTG ACGGAGAGTC 


GTTAGATAAA 


GGGGTTCTCG 


AATTCCATTT 


TGAGTCAATA 


1860 


AATCAGTCGC AAGCAACTGG 


TCAGGGCTCC 


CATTAAAAAG 


GATACGACCA 


TCGTTTATCA 


1920 


AGACAATCCG ATCCACAGGG 


CGATGCAGAA 


CGTCCTCCAA 


ACGGTGCTCG 


ATAATAAGAG 


1980 


TCGTCGTCCC CTCTTCCTTA 


TGAATCTGGT 


CAATCAATTC 


GATAATATCC 


TGACCTGACT 


2040 


TGGGATCTAG ATTGGCGAGT 


GGCTCATCAA 


ACAAGAGAAT 


CGGACTTTCA 


TCAATCAAGA 


2100 


CACCAGCCAG ACTGACTCGC 


TGCTTTTGTC 


CACCTGACAA 


ATCCTGAGGA 


CGCTGATCCA 


2160 


GTAAAGGAAG AAGGTCCAGC 


TTTTCAGCCC 


ATTTATAAAC 


ACGACCTTTC 


ATCTCATCTA 


2220 


GGGCTGTCAC ATCATTTTCC 


AGAGCAAACG 


CCAAATCTTC 


TGCCACAGAC 


AAGCCAATAA 


2280 


ACTGCCCATC TGTATCCTGC 


AAAACTGTGC 


TAACCAGATG 


AGACTTATCA 


TAGATGCTCA 


2340 


TATCAAAGGC TACTTGACCC 


TTTATCAAAA 


ATTCTCCATA 


TGTCTGACCC 


TTGTAAATAT 


2400 


TGGGAATAAT CCCATTCAAA 


CACTGACCCA 


AGGTAGATTT 


ACCTGACCCA 


GATGGTCCAA 


2460 


CAATTAAGAC TTTCTCTCCC 


TTGTAAATGG 


TCAAGTCTAT 


CCCTTGCAAG 


GTCGGTTCTT 


2520 


GTTGTGTTTC ATACCGGAAA 


GAGAAATCCT 


TCCACTCAAT 


TaTAGCTTCT 


TTCATCTTAC 


2580 


TCTCTTCATT CGCTTCTTAG 


ACTTCTATTT 


TATCATAAAT 


CAAGCCCTTC 


TTGCAGTCTC 


2640 


TCCTCTTAAA ATCTTAGCGC 


CAAAAAGATT 


CCTATCCTAG 


CTTACTTGCC 


TAACTAATCT 


2700 


ATAAACATCG AAAAAGACTA 


GTTGCCCAGC 


CTTCCCCATC 


ATTTTATACT 


CTTCGAAAAT 


2760 


CTCTTCAAAC CACGTCAGcT 


TCGCCTTGCC 


GTAGGTATGG 


TTACTGACTt 


CGTCAGTTTC 


2820 


ATCTACAACC TCAAAACCAT 


GTTTTGAGCc 


TGCTTCGTCA 


GTTCTATCCA 


CAATCTCAAA 


2880 


ACACTGTTTT GAGCAACtGC 


GGCTAGCTTC 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


2940 


TAGTCCTTTT TCAAACTTCC 


TGCACGAGTT 


TGGGTTCCTG 


CATAGGCAAG 


TAAGAGAAGA 


3000 
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GTTCCTGCAA TAGCTACAGA TACACCATTG GCAATTCCCG CAACAATCCC TTGTGCAAAT 3060 

ACTTTTTCTG CCGCTTCTTG ATAAATCACA ACATCTCCAA GTGGTGCCAA GACACCCCAA 3120 

ACAAGGGCAT TTGCAAGTAG TTGAATGAGA TTAAAAATAA GAATATCTTT CCAGTCAAAA 3180 

ACACCATTGA TCACGCGAAC GTACTTTCTA AAAAGTCCCA CAACTAAACC AAAGAGTCCG 3240 

CTAGCGATAA TCCAAGTCCA CCATAGACCA TAACCAACAA GAGAGTCCTT GATTGCATGA 3300 

£CAATCAACC CGACAAGCAA ACCGATAATC GGTCCAAAAA TAATAGAAAG TAGCGCTTGT 3360 

ACCGCATACT GAAGCTGGAT GCTTGTATTT GGAACAGGGG TTGGAATGTT GATCATCCCG 3420 

ATGACGACAA AGAGGGCAGC GCCAATTCCG ACAGCAACAA CTTGTTTAAT TGTAAATTTG 3480 

ATTTCCATAC TATTCTCCTA TTTTATCCTT CTATTTTCTT TATTTCAATG GTCCAAGATG 3540 

AACCGACACC TACATTATAG GCCTTGGCAA AGGAACCTTG GTTGATAGCC AAACCTAAAC 3600 

GATAGAGAGA GTTGATGTAA AGGATGGGTT GCCCAATTCT CACATCTGCA AATGATTTGC 3660 

CATAGACAAC CTGATTTTGA TAGACCAGCA TATCAGCATG ATAGATGGTC ACTTCAAAAC 3720 

GATCACCAAA TTCTGGTTCC AGCTTGTAAA ATTCTTCCCG TGTGATAGAG GTCCAAAGCG 3780 

AACCGAAACG CACATCCAGA ATATCAATGG CTCCCTTCAC CAGATGATCT TCTATGATGG 3840 

TCGCTACGAC TGGAAGCTCT ACAATCTGTT CCACACTGAG CTCTGGCCCT ACTTCCTCAA 3900 

AAGTAATGTG ACCACTGGCC AGTTTAGCAC CAGTATAGGC ATAGACATCA CGACCGTGGA 3960 

AGGTATAAGA ATGCTCTGTG TTTTGACGCC TATTGGCCAC CTCAGAAATC TCACGAATGG 4020 

CTACAATGCC AACGTGTTTC TTGATAAAGG AAAGCGTCCC ATTATCTGGC GTGACAATGT 4080 

ATTGATTTTT TGCAGTCTTG GCAACTACAC TCTTACGTTT CGAACCGACA CCTGGATCGA 4140 

CAACCGATAC AAACGTCGTT CCCTCAGGCC AGTAATCCAC CGTCT 4185 
(2 J INFORMATION FOR SEQ ID NO: 174: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

TGATAGAGTT AAAGCCGCTG AGTCATTCAA TCCATCTCCA ACCATCAAAA TAGTGTGACC 60 

TGCTTTCTGC AGTTTCTCTA CTAACTCAAA TTTCCCATCA GGTTTCAAGT CTGTATAGAC 120 

CTGATCAAAG GGCAAATCTT TGACTAATTC CTCTGTCCTA ATCAAGGTGT CTCCTGTTGC 180 



WO 98/18931 



PCT7US97/19588 



1104 

CAGAATCAAT TTTTyCCCCT GTGCCTTAAG TTTATCCAAG GCTGTTTTTG CTTCTTTTCT 240 

CAAAGGAGTA TGAATGCAGA ACATTCCAAT CAATTCATTT TGATAAGCCA AGAATAAGAG 300 

ATTGTAGTGA CTCTTGTACT CTTCAATTAA AGCATTTTGT TCTGAACTGA TATGAATCTG 360 

CTCATCCTGC ATCAAGACAT AATTCCCAAT AAGAACTGGT TGGCCATCTA TATGAGATTT 420 

GATCCCCTTG CTTGCGATAT ATTGGAGTTT CCCATGCATT TCCTCATGTT CAATTCCCTC 480 

TATCTCAGCT TGCTTGACGA TGGCATTAGC AATAGGATGA TAAATGTGTT CCTCAAGACA 540 

GGCACTGATT CTGAGAATAT CTTCCTCACT ATAGTCTCCA AAAGGTAACA CCTTTTCAAC 600 

TATAGGATAA CTAGTTGTGA TTGTTCCTGT CTTATCAAAC AAGAAAGTAT CAACTTCCAG 660 

ATATTTCTCC AGAACATCTC CATCCTTAAT CACCATTTCA CGGTTCAACC CTTCCTTGAT 720 

AACTGTCAAA TAAGCTACAG GAGTAGAGAT TTTCAAAGCG CAGGAGAAAT CGACCAATAG 780 

GAAAGAAATA GCCTTAGAAA AAGAACCTGT CAATAGGTAA GTCAGCCCAG CCCCCAAGAA 840 

ATTATATTTG ACGACTTTAT CCGCCATCTT GATGAAATAG CGTTGTTTCG TTTTCTTGTT 900 

TTCTTCAGAT TTCTTCATCA ACTCAATCAG CTGTAAAATA CGGCTGTTCA TCTGATTATC 960 

TGTTACACGA ATGCGTAACT CTCCAGTTTC TAATACTGTA TTTGCACAAA CCAAATCAGA 1020 

CTCTCTTTTT TCAACTGGAA AACTCTCTCC TGTCAAGGAA CTTTCGTTGA CCATACCTAA 1080 

ACCTGAAACT ACTTGTCCAT CAAACAGAAT TTCATTTCCT TGAGATAAGA TCAAGACATC 1140 

TCCTATTTGA ACATCGGAAC TCTTGATACT AACAACCGTA TCGCCCTGTA CTAGGAATAC 1200 

ATCGCTCTCT TTTGCAAGAA GACTCTGTTC TAAATCTGTT GCAGTTTTTT TCAAGGACCA 1260 

CTGATCTAAA TGATTCCCCA AATCAAGCAT AAACATGATA TTGCTAGCTG TCTTGGATTG 1320 

GTTCATAAAC AAAGACAATA AAATAGCCGA ACAGTCCAAG ACTTCCATCG TTAGTyCCTT 1380 

ACGCGCTAGT GTTTGATAGG CTTCTCTAAT ATAACCCAAA GCCTGATAAC AAGTCCATAT 1440 

ATAGCGAATA GGATACGGCA CAAAACTACG AAAAAGTACA CGCTTAACCG CTGCACCTGA 1500 

AACAATAGAA TAAGCACTCT CTTCTCTACG AATGGGAAGA GTCATCAACT CAGAAACTTT 1560 

CCCTTTATCA ATTCTTTTTA AAAAGGCTTC TGCATTATCT AATACAGAAA AGCCTTCTTT 1620 

TATGCGTAGA GTAAAGTGCT GTTGATCCAT GTAAAACTGG ATAGACTCAA TCCCCTTTTC 1680 

ATCTCTCGCC AAGGAACGAA GATAGTCTTG AATATCCAAG GTAAGTGAAA AAGAAGATGA 1740 

TAGTCGGATA TGTTGGTATC CTCTATGTAG CACTTTAAAA GACATATTAT TCACCTATAA 1800 

GGCTATCTAA TTGCTCTTCT TTTTTCTCTT GCTCGTACAA ATATTTGGCA TCTTGCAAGA 1860 

CATCGTCTCC ATGTTGCTTC ACAACAGAAA CAGATGCATC TAGCTCGTCT TTCAACTTGT 1920 

AAGCCTTAGC CAAAGCTTTA GAATAACCTT TTTTAGCTTC CTTACTTGCT AAGATTTTCA 1980 
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AACCAAGGGT ACCAAATGCG ACACCACCCA AAAATAATGA AGATTTTTTC GCAACTTTTG 2040 
CAACGGTTAA TACTTCTTTT AACATAGGG 2069 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAATCTTGCG CAATAAAGCT CATCTCCATC TCCCGATTGA AACAGTCACT CCCCGGACTG 60 

TTTCAACGTC CCAAGACATA ATCTTAGGCA GATTTCTAAA ATTACACTCA AAGTGGAAGT 120 

CATTGAGCTT TCGAATGACA GTTGAAGTTG AAATGGCCAG CTGATGGGCA ATATCGGTCA 180 

TAGAAATCTT TTCAATTAAC TTTTGCGCAA TCTTTTGGTT GATAATACGA GGAATTTGGT 240 

GATTTTTCTT GACGATAGAA GTTTCAGCGA CCATCATTTT CAAGCAATGA TAGCACTTAA 300 

AACGACGTTT TCTAAGGAGA ATTCTAGTAG GCATACCAGT CGTTTCAAGG TAAGGAATTT 360 

TATAGGGTCT TTAATGTCTA GTAATTTTGT GATAAAATGT AATTGTTCCA TATGATTCTT 420 

TCTAATGAGT TGTTTTGTCG CTTTTCATTA TAGATCTTAT GGGACTTTTT TTCTACCCAA 480 

AATAGGCTCC ATAATATCCA TAGGGAATTT ACCCACTACA AATATTATAG AGCCCAAAGT 540 

TTTAGGTCGC TTGATAATAT GCGTTTTTTG AATTTTATAG ACTGCTCGTT TAAACTCTAT 600 

TTACTTCGTA CCTTCTGGAG CGAGACGGAA TATTAGTCAC ATACAAAATG AGTACTATTA 660 

GG ATT TT ATT TTCATGTACA ATTTCAGCCA GTCTTGTTAT AATCAGCCTA TAGGAATCAA 720 

GGAGGTGACT CTTATGGCTG TTTTTGTGTC TTTGGATGGA ATTGTGGTAG AAGTCCTTGA 780 

TGTCTTTTCT TCTTTTAATG GGGATAGTGA GTTTTTCTTG TGTATAGCAT TTTGAATCTG 840 

GAATAGGACG CCATGACTGC TAAAAGATTT CTATAAATTA ATTTGATTTT CCTAATCAAT 900 

TTGTTCATAT CTTATTTCAT TCCACTATAA ACGTCTTAAA GACAAGAGTC AGTTTGTTAT 960 

GGAACGCTCT CAGTTCGAGG AGATGTTCCA ACTTCAAAGT AGTCGCTTGA CGACGCAAGA 1020 

AAAATTACAA TTGTTTACCT CTGTGTTTGC TGGCCGTTAT GATGTTTATG CTAAGAATTT 1080 

TATCAATGAA CAAGGGAAAA TTCAGTATTT TCCTTCCTAT GATTATGGTT GGAAGCAGTT 1140 

GCCACCTGAA AAACGGAGTT TCCAGACATT GACGAACTCC GTTTTGAAAT CTCATTTTCG 1200 

TGGGGAGGCA GCTATCGGTA TCTTTCCTAT GCACTTAGAT GATAGCTGTT ATTTTTTGGT 1260 
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ACTGGATTTG GATGAAGGAG ATTGGAAAGA AGCTGGTTTA ACCATTCGAA GAATAGCCAG 1320 

GGAACGCCAG ATGGAAGCCC ATTTAGAGAT TTCTCGTTCG GGTCACGGAC TCCATATTTG 1380 

GTTCTTCTTT GAGGAAGCGA TTCCGAGTCG AGAGGCTCGC TTGTTTGGAA AGAAACTGAT 1440 

AGAACTGGCA ATGCAGGAAA GTATGCAACT GTCCTTTGAT TCTTTTGATC GCATGTTTCC 1500 

AAATCAGGAT GTCCTTCCTA AGGGGGGATT TGGAAATTTG ATTGCCTTGC CTTTTCAAGG 1560 

AGAAGCTTAC CATCAAGGGC GAACGGTCTT TGTGGATGAA CAGTTTCAGC CTTATGAAGA 1620 

CCAATGGAGG TATCTACAAG AAATTCAGAG GATTTCAACT GCTAAAGTGG CACTGTTAAT 1680 

CCAAGAGGAG TTAGGCAAGC AAGAATTGGA TAAGGAGTTG AAGGTCGTTT TATCCAATAT 1740 

GATCCAACTT GAAAAATCGT CTGTGACATC CAAGGCACTT TTTTCTTGAA AAATATGGCT 1800 

TCCTTTTCTA ATCCCGAATT TTATAGTAGA TTGAAACTAG AATAGTACAC CTCTGCTTCT 1860 

AAAACATTGT TAGAAATCGA TTTGACTTTC CTGATCGATT TGTCCTGTTA TTATTTCATT 1920 

TTACTATATT TAAAGCAGGC TATGCGACAG CCAACCTATC AAATTCCTGA GAGAATGTAT 1980 

TTATTTGGAG AATCCGATCA TTATTTATGG TTGCCAAGAG GTTTGCTGTA TCCATTGCAA 2040 

GATAAATTTA AGCAGGTATC TGTGGAAGAT AGGAGAAAGG TACAAAGGTC TATTAGCGTG 2100 

GAATTTAAGG GAGAACTCAC TTTTGAGCAA GAGTTAGCCC TGTCAGATAT GACTTCTAAA 2160 

GAAAATGGTT TACTTCATGC GGAGACTGGT TTTGGGAAGA CCGTTTTAGG TGCTGCTCTT 2220 

ATCTCTGAAC GGAAAACAAA AACAATTATT CTAGTCCATA ATAAGCAACT CTTAGACCAA 2280 

TGGCTAGATC GCTTAAACTG CTTTTTGACT TTCGAAGAGG AGGAGGCTAT CCGTTATACG 2340 

GCATCAGGTC GTGAAAAGGT AATCGGCTAT GTTGGGCAGT ACGGTGGGAC TAAGAAATGG 2400 

CTGAGTAAAC TGGTTGATGT CGTTATGATT CAATCTCTAT TTAAGTTGGA AAATAGTCAA 2460 

AGTCTTTTGG ATGAGTATGA GATGATGATT GTGGATGAGT GTCATCATGT CTCTGCCTTG 2520 

ATGTTTGAAA AAGTTGTTGC TCAGTTTAGA GGGAAGTATC TTTACGGTTT GACGGCTACG 2580 

CCTGAGCGTA AGAATGGTCA TGAGCCTATT GTTTTTCAGA GAATTGGTGA GATACTCCAT 2640 

ACTGCTGATA AGAGGGAAAC GGATTTTAAA CGGCAATTGC AATTAAGATT CACTTCTTTT 2700 

GGTCATTTGG AAATTGAAAA GACCAAAGCA AGTAATTTTA TACAGCTTAG TGATTGGATT 2760 

GCTACTGACT CAGTGAGGAA TCAGATGATT CTCAAGGATA TTCTAGCCCA AGTGGCAGAA 2820 

GGACGGAATA TCTTGGTTTT AGTTAATCGA ATTCAACAGA TAGATGTCTT TGAAAAATTA 2880 

TTGAAAGAGA AAGAGGTTGA TGACTGTTAC ATTATTAGCG GAAAAACCAA AGTCCGAGAG 2940 

AGAACGAGTT TACTGGAGAC GTTAGAACAG TTAGATAAAG GGTTTGTTTT GTTGTCTACT 3000 

GGAAAATACA TTGGCGAAGG TTTTGACTTA CCTCAGTTGG ACACGCTTAT CTTGGCAGCA 3060 
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CCCTTTTCTT 


GGAAAAATAA TTTGATTCAG 


TATGCAGGTC 


GGATTCATAG 


AAACTACAAG 


3120 


GATAAGTCTT 


TGGTGCGTAT TTTCGATTAT 


GTGGATATTC 


ATGTTCCTTA 


TTTAGAAAAG 


3180 


ATGTTTCAGA 


AACGACAAGT AGCTTATCGA 


AAGATGGATT 


ATCGTGTCAT 


CGAGGGTGAG 


3240 


GAGAAACAAT 


TCGTTTATGT TGATAGTAGA 


TATGAGAAGG 


TGTTGAGAGA 


GGACTTAGCA 


3300 


GGGGAAAGAC 


AGGAATGTCT GCTTATTTTA 


CCTTATGTGC 


ACCAGACAAA 


ACTGATGAAT 


3360 


-TTTCTAAAAG 


AATTTAGGAT TAGTCAAATT 


GAGATATGTA 


TACCAGAGAC 


GGTTGCAAAT 


3420 


AAAGCATGGC 


TAGACCAGTT GAAGAGCCAG 


AAAATTAAAG 


TGTCTTTTAC 


TCAATCAAAA 


3480 


ATAGTAACGC 


CTATTCTTTT GGTGAATAAG 


ACTATTGTTT 


GGTATGGTGC 


AATGCCATTA 


3540 


TTAGGGAAGG 


TAGATGAGAT GACCATATTA 


CGTTTGGAAT 


CAGCTAGTAT 


AGTTTCTGAA 


3600 


CTAGTGGCAG 


GTTTACGATA GAGAAAATTT 


TTAAAAATTT 


CTATGTATGA 


TTTTCATTTC 


3660 


TTTAGTGAGA 


CTGTTGCCAT TATCACATTC GAATCACACA 


AAATAAAAAA 


ATTTTTATAA 


3720 


GTACTTGACA 


AATAGATTGA AATATCATAA 


AATAAAAACG 


GTTACAGAGT 


TATTAATTAT 


3780 


TTAAGCTTCA 


TGTCACCATT AAAAATTGAA ATAAAAGGAT GTTATCACTA ATACAAGTGA 


3840 


GCAGGAACCT 


ATTTAATCAC ATCAGAAGAA GTTTCTTGAT 


GTTTTTAAGT 


AGGTTCCTTT 


3900 


TATTTTAAAA 


GGG AAATTTT ATGATCATAA 


AACGAATACT 


AAACCACAAT 


GCCGTAATTG 


3960 


CGCAAAGTAA 


AAAAGATATC GATATTCTTC 


TTTTTGGAAG 


GGGAATAGCT 


TTTGGAAGAA 


4020 


AAACTGGAGA 


TAAAGTAAAT CCAATTGATA 


TTGAGAAAAG 


TTTTTTTCTC 


AAAAATAGAG 


4080 


ATAATATGAC 


CCGTTTTACA GAGATGTTTA 


TTAACGTTCC 


TTTGGAGTTG 


GTGTACATCA 


4140 


CCGAAAAAAT 


AATTAACCTA GGTAAAATAA 


CATTGGGTAA 


TAATTTTGAT 


GAAATTATCT 


4200 


ATATTAATTT 


AACGGATCAT ATTTCTTCGA 


GCATAGAACG 


TTATAAAGAA 


GGGATTATTA 


4260 


TTTCGAATCC 


CCTACGCTGG GAAATATCGA 


AATATTATAA 


AGAAGAATTT 


GAACTTGGGA 


4320 


AAAGGGCTTT ACAAATAATA AAAAAAGAGT TAGGTATTGA ACTTCCAATT 


GACGAAGCTG 


4380 


CATTCATAGC 


GCTACATTTT GTTAATGCTA ATTTAGAAAA TAATTTTCAA GAGTCGTATA 


4440 


AAATCACTGA AATAATTATG GGAATTGAGA 


AAATCATTCA 


AGATTTCTAT 


TGTACTGAGT 


4500 


TTAACCAAGA 


TTCTATTGAT TATTATAGAT 


TCATAACTCA 


TATGAAATTA 


TTTGCCCATC 


4560 


GCTTGGTTGA 


GAATACAACT TATTGTGACG 


ATGATGA 






4597 


(2) INFORMATION FOR SEQ ID NO: 176: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CGGCTTATTT ACTACTTGTT CCATCATATA TGGAATATGC ATGAAACCTG CTCTCATATT 60 

AGGGAATTTT TTATCCACTA AATAAAGAGC TTGGTACATC AAATGATTGC AAACAAAGGT 120 

TCCTGCACTA TTGGATACAA CTGCCGGAAG TCCCTGTTTT TTGATAGCTT GTACCATCGC 180 

TTTGATAGGT AAACTACTAA AATAGGCCGA TGCTCCATCA ATACGAATCG GTGTATCAAT 240 

TGGTTGATTG CCTTCGTTAT CAGGTATGCG AGCATCATCT TGATTAATAG CCACTCGTTC 300 

AGGTGTTAAG CCGGTCCTGC CGCCTGCTTG TCCAATACAA AGTACAGCAT CTGGTTGATA 360 

TCGTAATATT TCTGCCTCTA AAACTTCTGA CGACTTATAA AAAACCGTTG GAATTTCTAC 420 

, CCAGCGAACT TCAGCCCCAT TAATCTCAGA TGGTAATAAT TTTACAGCCT CCAAAGCTGG 480 

ATTAATCTTT TCACCTCCAA AAGGATTAAA ACCTGTAACC AATATTTTCA TTTTATTTTC 540 

CTTTACTAAA ATGCGAGAAA GTACATTAAG AATATGTGAA TAACAATCAT TACTAGAGCA 600 

ACACCTGCTT GAGCCTTTAT AACGCCATTC TGATCTTTCA TATCCATCAA TGCTGCTGGT 660 

AGAGCGTTAA AATTAGCAGC CATTGGGGTC AATAAGGTCC CACAATAACC TGCTGTCATG 720 

GCAAGAGCAC CAGCCACAAT TGGATTAGCT CCCAGAGCAA ATACAAAGGG AACTCCAACA 780 

CCTGCTGTAA TAACGGTGAA TGCTGCAAAA GCATTTCCCA TAATCATTGT GAATAGAACC 840 

ATTCCAAGAA CATAGGCCAA AACTCCTATA AAGCGACTAT CTGAAGGAAC AATACCGCTA 900 

ATCAGATGAG AGATAACATC ACCAACACCT GCTACAGTAA AAATAGCCCC CAAAGCCCCT 960 

AATAATTGAG GAACAATCCC ACTTGTTGAA ACTTGCTGAG TCATTCGATT ATTTTCTGAT 1020 

AACAGACTCT TAGGGTGACT ATTGGTAATC ACAAGAACAG AAATTGTAGC AAACAAGGCG 1080 

GCAAGGCTAA TCGAAATCTT GCTAAATTCT GGAATCATTT GCGCTAAGAC CAACGCAAGT 1140 

ATTGCCATCA GCATAACTGG AATAAAAATT TTATTTTTCA ACCTGTTAGA TTCAATATTG 1200 

GCTTTCATTT CATCTAAGGA TGGCAAGGTT CCGATACGGA CTTGCTTAAA CAATGTTAAC 1260 

AGCGATAATA GGATTACAAT AATACCAATA CTCATATTTG GCATATAGGA ACCACCTATA 1320 

AACGTAATAG ACAATAGAGT CCAAAATGCA GATGTCCCAA GTCGAACTGG GTTTGTTTTA 1380 

TCTTTATAAC TACAATAGGC TGTATGGAGA AATTGACAAC CAATCACAAT ATAGGTCAAC 1440 

TCTAATAGTT GCTTTGCCAA CTCTGTCATT TTTGTTCTCC TCCCCTAGTC TTTTTTGATA 1500 

TCAATTTTTT ATCAAATAAA TAATTATAAA TCCCCACTAC AATAAGTGTT ATAACAGCAA 1560 

CAATAATAGA TGTAGAAGCA ATCCCTGCAT AATTGCTTTC ATAGCCTAAC TGATCTAATG 1620 
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TTCCCCCTAT CAAGAGGACT CCCCCAGCAC 


CTACAAACGT 


i ATTTTGAGCA 


AAGAAATTTC 


1680 


CAAAATTTTC ATTCGCAGCC GCACGCGCTT 


TTATTGTCTC 


ATCTTCAACC 


TCTGTTAACT 


1740 


TTCTACCTAA TTGAGACTCT GCAGCTGCTT 


CTCCCATAGG 


TTGAACCAAA 


GGTCTGACAA 


1800 


ACTGAGGGTG TCCTCCTAGA CGAATTGAAA 


AGAAACCAGC 


TAACTCTCGA 


ATAAAGAAAT 


1860 


AAACTGTATA GAAGTTTCCA ACTGTCAGAC 


CTTTAATCTT 


TCGAATCAAA 


TCGATTGATC 


1920 


GTTGCTTGAG TCCAAAGGTT TCTGACAGCC 


CCACAAGAGG 


CAAGGTAACC 


ATAAAAATCG 


1980 


TGAGCACTCG CTGATTGCTA AATTCTTTTC 


CCAAAATCTC 


CAAAAATTCA 


ACGAGAGAAA 


2040 


CACCTGAAAC TAAAGCTGTA ACCAAACCAG 


CTAAGACTAC 


TGTTGCAATT 


GTATCAAATT 


2100 


TTAAAATAAA ACCCACAACA ATGATTGCTA 


TTCCTATTAA 


TCTAATCCAC 


TCCATATCAA 


2160 


ACTCCTTTAT ATTCAAAATG ACAGTATTTT 


TAAAATTTTA 


TCAAGATCAA 


TACCATTCCT 


2220 


TATTTAATGT GTTTTTCTAG TTCTTTTTGG 


TATTTGCTAT 


TGGATTCCAA 


TTTTTCTTTT 


2280 


TGCCATTTTT TAAAAACCTC GTTATATTCT 


TTTGTTGTAA 


CAATATCTTT 


TTGCAATTTC 


2340 


ATTCCTTTAA AGATATATGG ATCCCCCTTA ATACCAACTT 


GTGAGTATGG 


TTTTGAGAAT 


2400 


GGTACTACGT TACTTACAAC TGGAGAACCA 


CCAGATGAAG 


CTGTTGGCAT 


CAATAATGAA 


2460 


CTATCTGTCG ACCAAGCTTG AGCTTTGGCA 


TATTTTTCAT 


ATCTTTTCTC 


TAGGTCAGTG 


2520 


GTCTCAGAAA CAGCATCTTC TAACAATTTC 


TTATATTTAT 


CCAAACCAGG 


TTTAGCTACA 


2580 


ACATCCTTAT CTTTTCCTTT CGTAATACCA 


AGGTGTTTCA 


TGGCAGAACC 


AGATTTTGGA 


2640 


TCTATAATAT TCAAGTGAGA CGCTGGATCT 


TGATAGCTTG 


GAGCCCATCC 


TGTACTGTTC 


2700 


AAATCATAGT CTTTTTGAGA AGGAGCAACA 


TTGCCGTATT 


TATCATTTTC 


CATCAAACCA 


2760 


TCAATAACAT TTCCAATAAC GTCTGTCCTC 


GATGTTCGAG 


TCGCTATACT 


GTAGCCCAAT 


2820 


GATGCTGGAT CTACTGCATA GACATAAGAA 


AATGTTGTCG 


GTGCATCTGC 


TTCTTTATCA 


«OOv 


GTTTTTCCAC AAGCCACTAA AATAGCTGAC 


GTGCTCAGGA 


CCACTCCTGC 


TGTTAAGAGC 


2940 


CACTTTTTCT ATTTCATAAA GAATCTCCTT 


TGGTTTATTT 


TAATCTACTT 


TTACAATCCA 


3000 


ACCTTCTGGC GCTTCAATAT CGCCAAACTG 


AATACCCGTC 


AATTCATTAT 


ATAATTTACG 


3060 


CGTCACAGGA CCTACTTCTG TTTCACTATA GAATACATGG 


AAATCATCAC 


CATGTTGAAT 


3120 


ACCTCCAATT GGAGAAATAA CCGCTGCTGT 


ACCACAGGCA 


CCTGCCTCTA 


CAAAACGGTC 


3180 


AAGATTATCA ATTGGAACAT CACCCTCAAT 


AGGAGTTAAT 


CCCAAGCGAT 


GTTCTGCCAA 


3240 


ATAAAGCAAG GAATACTTGG TAATAGATGG 


CAAGATAGAT 


GGACTCAATG 


GTGTTACAAA 


3300 


TTCATTATCA GCTGTAATTC CAAAGAAGTT 


AGCTGATCCG 


ACTTCTTCAA 


TCTTTGTATG 


3360 
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AGTTGATGGG TCCAGATAGA TAACATCTGA GAAATGACGT GACTTGGCCA TTTTTCCTGG 3420 

TAAGAGACTT GCAGCATAGT TTCCACCAAC CTTAGCCGCA CCTGTACCAT TTGGTGCTGC 3480 

ACGGTCGTAC TCATCCTGAA TCAAGAAGTT GGTTGGGACC AAACCACCTT TAAAGTAATT 3540 

TCCAACTGGC ATAGCAAAGA TGGTGAAAAT GTACTCTTCT GCCGGTTTTA CCCCGATAAT 3600 

ATCTCCGACA CCAATCAAAA GAGGGCGAAG ATATAAGGTT CCACCTGTTC CGTATGGTGG 3660 

TACGTATTCT TCATTCGCAC GGACAACTGC TTTACAAGCT TCTACAAACA TGTCTGTCGG 3720 

AACTTGTGGC ATCAAGAGAC GGTCACATGT ACGTTGCAGA CGTTTAGCAT TTTCATCAGG 3780 

ACGGAACAGT TGAACACTGC CATCCTTAGT ACGATAAGCT TTCAAACCTT CAAATGCTTG 3840 

TTGTCCATAG TGAAGACTTG GAGAAGACTC TGAAATATGC AAAGTTGCAT CCTCTGTAAG 3900 
CTCTCCTTGA TCCCATTGTC CATTTTTGAA ATGAGCAAGA TAGCGATAAG GTAATTTCAT 
ATAGGAAAAA CCGAGGTTTT CCGG 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



3960 
3984 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TATCTAATTA TTGGTTTTTA TCGCTGACCT TGGCTATTGT TGGGGTTGTT TTACCCTTGT 60 

TGCCTACAAC ACCTTTCCTT TTGTTGTCTA TTGCTTGTTT CTCCAGAAGT TCCAAGCGAT 120 

TCGAAGATTG GCTTTATCAT ACCAAGCTCT ATCAAGCATA TGTAGCTGAT TTTCGTGAGA 180 

CCAAGTCTAT TGCGCGTGAA CGAAAGAAAA AAATCATCGT CTCTATCTAC GTCTTGATGG 240 

GAATTTCTAT TTATTTTGCA CCTCTTTTAC CAGTCAAAAT CGGTCTGGGT GCTTTGACCA 300 

TCTTTATTAC TTATTATCTC TTCAAGGTCA TTCCAGACAA AGAATAGTTA AAACAGTAGT 360 

TATTTGCCTT GATAAAATTG AAAGCATATT CATAACAATA TGATATAATA AAATTGAAGT 420 

AATATTCAAG GAGAATCAAA TGATTTACGA ATTTTGTGCT GAAAATGTGA CTTTACTTGA 480 

AAAAGCGATG CAGGCTGGAG CTCGTCGGAT TGAACTCTGT GATAATCTAG CAGTTGGTGG 540 

GACAACACCC AGCTATGGAG TGACTAAGGC AGCGGTTGAA CTGGCAGCTA ACTACGATAC 600 

AACCATCATG ACCATGATTC GGCCACGTGG TGGTGACTTT GTCTATAATG ACCTAGAAAT 660 

TGCTATCATG CTAGAAGACA TTCGTTTGAC TGCTCAGGCT GGAAGTCAAG GGGTTGTATT 720 

TGGAGCTTTA ACTGCTGATA AAAAGTTGGA TAAGCCTAAT CTGGAAAAGT TAATTGCTGC 780 



WO 98/18931 



PCT7US97/19588 



1111 

ATCAAAAGGA ATGGAAATTG TCTTTCACAT GGCCTTTGAT GAACTAAGTG ATGAAGATCA 840 

AGCGGAAGCT ATTGACTGGC TCAGTCAAGC CGGTGTCACT CGTATCCTAA CTCGTGCTGG 900 

TGTGTCTGGC GACTCCTTAG AAAAACGTTT TGTTCACTAT CACAGAATTT TGGAGTACGC 960 

TAAAGGTAAA ATTGAAATTC TACCAGGTGG GGGGATTGAC CTTGAAAACC GTCAAACCTT 1020 

TATCGACCAG GTGGGGGTAA CACAATTGCA TGGTACTAAG GTTGTTTTTT AAAAAATAGA 1080 

AAGGAACTGC TAGCTTTGGG TAGCAGTTTT CACTTATGTT TGAAATTTTT AAATCCTATC 1140 

AATTTAATCA AGAAAAGGCT CATGATTATG GTTTTATAGA AAATAGCGAA GTCTGGACAT 1200 

ATAGTTGCCA GATTTTGCAA GGTGACTTTG TCATGACTGT GTCCATCACT GCTGATAATG 1260 

TGAACTTTCA AGTCTTTGAC CAAGAGACTG GTGACCTCTA TCCTCACGTT TATATGGAAA 1320 

GCATGAGGGG AAGTTTTGTC GGAAATGTCC GTGAGGCTTG TCTGGAGATT CTTTACCAGA 1380 

TTCGGAAGGC TTGTTTTGAT GTGCAAGATT TTATCTGTCA TCAGACTAAG CGTATCATGA 1440 

CTCAAGTTCA GGAAAAGTAT GGAAACCAGT TGGAGTATCT GTGGGAAAAA TCGCCTGATA 1500 

CAGCTGTATT GCGCCATGAA GGCAATCAAA AGTGGTATGC CGTCTTGATG AAAATCTCTT 1560 

GGAATAAGCT GGAAAAGGGC AGAGAAGGAC AAGTGGAAGC AGTCAACCTC AAGCATGACC 1620 

AAGTAGCTAA TTTGCTTTCA CAAAAGGGGA TTTATCCAGC CTTCCATATG AGCAAGCGCT 1680 

ACTGGATTAG TGTGTCCCTT GATGATACTT TATCAGATGA AGAAGTACTG GAATTGATAG 1740 

AAAAAAGTTG GAACTTAACC TCTAAAAAAT GAAATATTTT AATAATTTTC ATGAACTTTC 1800 

AATTAGCTAA ATATTCTTTA CTGAAGAGAT TTTTAGAAAA TATAGGATTT ACCACACTAG 1860 

AGGAATATGG TGCCATCTTC AAATACCTGA TTGAGAATGT CAAGACGGAT CGTCAGATCA 1920 

TCTATTCGCC TCACTGTCAT GATGACCTCG GAATGGCAGT GGCAAATAGC CTTGCTGCTG 1980 

TCAAGAATGG TGCAGGACGT GTTGAAGGGA CTATCAATGG TATTAGGGAG CGAGCTGAAA 2040 

ATGCTGCTTT GGAAGAAATT GCAGTGGCTC TCAATATTCG CCAAGATTAC TACCAAGTAG 2100 

AAACCAGTAT TGTCCTAAAT GAGACCATCA ATACGTCAGA AATGGTTTCT CGCTTCTCTG 2160 

GTATTCCAGT TCCTAAAAAC AAAGCCGTCG TTGGTGGCAA TACCTTCTCC CACGAATCTG 2220 

GTATTCACCA AGATGGAGTC CTTAAAAATC CTCTCACTTA TGAGATCATC ACACCTGAAT 2280 

TGGTTGGTGT TAAGATTCTG CTTGGAAAAT TATCTGGTCG CCATGCTTTT GTTGAGAAAC 2340 

TGAGAGAATT GGCCCTAGAT TTTACAGAAG AGGATATCAA ACCACTCTTT GCTAAGTTCA 2400 

AGGCACTGGT CGATAAGAAG CAAGAAATCA CAGATGCAGA TATTCGAGCT TTGGTAGCTG 2460 

GAACCATGGT TGAAAATCCA GAAGGCTTCC ACTTTGATGA TTTACAACTT CAAACTCATG 2520 
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\—t\\it\ 1 t\t\ LAI fGAAGCG 


CTCGTTAGCC 


TAGCCAATAT 


GGATGGTGAG 


l AAAGTCGAAT 


2580 


l i Aft i\jt-t»AL AGGGCAAGGT 


TCCGTTGAAG 


CAATCTTTAA 


TGCTATCGAT 


AAGTTCTTTA 


2640 


HLLnnlHTji lHjlii\iGTG 


TCCTACACTA 


TCGATGCGGT 


AACAGATGGA 


ATCGATACCC 


2700 


AGGATCGGGT tttggtcact 


GTTGAAAACA 


GAGATACAGA 


AACCATCTTT 


AATGCAGCAG 


2760 


GGCTTGATTT TGATGTGTTG 


AAGGCTTCTG 


CTATTGTCTA 


TATAAACGCT 


AATACCTTTG 


2820 


TTCAAAAAGA GAATGCAGGT 


GAGATGGGAC 


GCAGTGTTTC 


TTACCACGAT 


ATGCCTAGTG 


2880 


1x3 1 aaa(j<jA(j aaggctatgg 


CAAAGAAAAT 


AGTAGCTCTA 


GCAGGAGACG 


GAATTGGCCC 


2940 


A(j AAA I V, A 1 \j laAGGTTGGTT 


TAGAAGTTCT 


GGAGGCTCTA 


GCTGAAAAAA 


CAGGTTTTGA 


3000 


ctatgagatt gacagacgac 


CGTTCGGAGG 


TGCAGATATT 


GATGCAGCAT 


GACCTCCCTT 


3060 


acctgatgaa acccttaagg 


CAAGTAGGGA 


AGCAGATGCT 


ATCCTACTAG 


TAGCTATCGG 


3120 


tagtcctcag tatgatggag 


CAGTGGTTCG 


CCCTGAACAA 


GGCCTGATGG 


CTCTCCGTAA 


3180 


ggaactcaat ctttacgcta 


ATATTCGTCC 


TGTAAAAATC 


TTTGACAGTC 


TCAAGCATTT 


3240 


GTCACCACTC AAACTGGAAC 


GAATTGCTGG 


TGTAGACTTT 


GTCGTGGTGC 


GTGAATTGAC 


3300 


aggcgggatt tactttggat 


ATCATATTCT 


TGAAGAGCGC 


AATGCGCGTG 


ATATCAACGA 


3360 


CTATAGCTAT GAGGAAGTGG 


AGCGGATTAT 


TCGCAAAGCC 


TTTGAAATTG 


CAAGAAATCG 


3420 


CAGAAAAATC GTTACTAGTA 


TCGATAAGCA 


AAATGTTCTA 


GCGACCTCAA 


AACTCTGGCG 


3480 


GAAAGTAGCT GAGGAAGTCG 


CACAGGATTT 


CCCAGATGTA 


ACCTTGGAAC 


ATCAGCTGGT 


3540 


AGACTCAGCT GCTATGCTTA 


TGATTACCAA 


TCCTGCTAAG 


TTTGATGTTA 


TTGTAACGGA 


3600 


GAATCTTTTT GGAGATATTT 


TATCTGATGA 


ATCAAGCGTC 


TTATCTGGTA 


CACTTGGGGT 


3660 


TATGCCATCA GCCAGTCATT 


CTGAAAATGG 


ACCAAGTCTC 


TATGAACCTA 


TTCACGGTTC 


3720 


AGCACCTGAT ATTGCAGGTC AAGGAATTGC 


CAATCCTATT 


TCCATGATTT 


TATCAGTTTC 


3780 


CATGATGTTG AGAGATAGTT 


TCGGACGTTA 


TGAGGATGCA 


GAGCGTATCA 


AACGTGCTGT 


3840 


TGAGACAAGT CTGGCGGCAG 


GAATTTTAAC 


GAGAGATATA 


GGAGGTCAGG 


CTTCAACAAA 


3900 


GGAAATGACG GAAGCTATTA 


TTGCAAGGTT 


ATGAAGTTAG 


ACGAAAAAAT 


TACTCTAGTC 


3960 


CTTTTGATTT GGAATGTCAT 


CATTTTCTTG 


ATTTATGGTA 


TTGACAAATC 


TAAGGCAAGG 


4020 


AGAAGAGTTT GGCGCATCCC 


TGAGAAAATC 


TTACTTATTT 


TAGCCTTTAC 


TTTTGGTGGT 


4080 


TTTGGTGCCT GGCTAGCAGG 


AATCATCTTT 


CACCACAAGA 


CTCGAAAATG 


GTACTTTAAA 


4140 


ATAGTTTGGT TTCTTGGGAT 


GGTGACCACA 


CTAGTAGCCT 


TATATTTTAT 


TTGGAGGTAA 


4200 


TGGATGGCAG GGTCTTCGAG 


GGAATACGCT 


GCTTGGGCTC 


TAGCGGACTA 


TGGTTTTAAG 


4260 


GTCGTGATTG CAGGATCTTT 


CGGTGACATT 


CATTACAATA 


ATGAACTCAA 


TAATGGCATG 


4320 
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TTGCCAATCG TTCAGCCTAG AGAGGTTAGA GAGAAACTAG CCCAGCTAAA ACCAACCGAC 4380 

CAGGTAACTG TGGACTTGGA ACAACAAAAA ATCATCTCAC CAGTTGAAGA ATTCACCTTC 4440 

GAGATAGATA GCGAGTGGAA ACATAAACTC CTAAATAGTT TGGATGATAT CGGTATTACC 4500 

TTGCAGTATG AAGAGTTGAT TGCTGCTTAT GAAAAACAAC GACCAGCCTA CTGGCAGGAT 4560 

TAGAAAAAAT AGAAAAGGAG ATATAGTAAA CTGAAATAAG ATGTAAACAA ATGAATTGGA 4 620 

GCTTAACATC CATTTCCAGC AATTTTTTAG AAACTACAGT GGACTATTCT GGATTCAACA 4680 

CATTATAAAA TTATGACAAA ACACATTCAC AAGAAGGCTA CGACATTTTA AAAGGTGAGG 4740 

GCGGATGTAT CGTTTGCCCT ACTAAAGTTG GTTACATTAT CATGACCAGT GACAAGGCAG 4800 

GACTTGAGCG TAAGTTCGCA GCCAAAGAAC GTAAGCGTAA CAAACCAGGT GTTGTTCTCT 4860 

GCGGTAGCAT GGATGAACTT TGCGCTTTAG CGCAACTCAA CCCAGAAATT GAAGCATTCT 4920 

ACTAAAAACA TTGGGATGAA GATATTCTTC TTGGTTGTAT CCTTCCTTGG AAACCAGAAG 4980 

CCTTTGAAAA ACTCAAAGCA TACGGGGATG GCCGTGAAGA ACTTATTACT GATGTACGTG 5040 

GTACTAGCTG TTTTGTTATC AAGTTTGGAA AAGCAGGTGA ACAATTGGCT GCCAAGCTTT 5100 

GGGAAGAAGG TAAAATGGTC TACGCCTCAT CTGCTTCAAT GACAAAACGA TTGAAACTCG 5160 

CTATGAGCAA GGTGTAATGG TGTCTATGGT CGATAAGGAC GGCAAACTCA TCCCAGAACA 5220 

AGGAGGAGCA CGTTCAACTT CACCAGCTCC AGTTGTGATC CGTAAAGGGC TTGACATTGA 5280 

TAAAATCATG ATGCACCTGT CAGATACTTT TAACTCATGG GACTACCGTC AGGTTGAGTA 5340 

TTATTAGGAT AGAGAAGAAG TCTAGTGTTA TGAGATATTA AAGCTCCTAA CACTGGGCTT 5400 

TTGTTTAGAA TTTCTTTTCT TTTTCTATAG GATATGGTAT TCTATGTAGA AAATATATGT 5460 

TAATAAGTAA TGCCAATATT TAAACATCAT TAGTAAAAGG AGTTAGATTG ATGAATAAAA 5520 

GAAAAGTTAG TTTAGAAGAT TTTTATAAAT GGTATAGTCT AAATAAAGAA GAGTTATTAA 5580 

ATAAGGCAAC TGTTGGTGAA AAGTTTAATG ATAAATTAAA AGAAGAGTTT CTCCAGGAAT 5640 

GGCCTTTGGA TAGGATTTTA ACAATGTCAA TCGATGAATA TGTAATAGGA AAGGGACAGC 5700 

AAAATAAGTC TTTATGCTAC GCTCTTGAGA AGGGAAAATA CAAAAATCTA TTTCTTGGAA 5760 

TTTCTGGTGG CTCAGCTTCA AAATTTGGTA TTTATTGGAA TAAAAAAACA AACAAATATA 5820 

AAGATCAAGC TAATAATGAG ATTTCAGAGT TGGATCAGCG ATTTTCAAAA TTAAAATCAG 5880 

ATTTGTATGA AATTATCAAA GAAGGTATTC GTTTTAACTT TGAAAATCCT ATTTTTGATA 5940 

TGAAAAGATC AACAAATGAA TTTATTGGTC GTTCTGCTAT GGTGACAAAA TTACTTTGTA 6000 

TCTATACTGA GGGAGATCCT TTCTTTGGTG TAAATATTAA TAGTCAGAAA GAATTTTGGA 6060 
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ACCACTTTGT TTCTCAGACA AATCAAGGTG GACCTTATCT GCAAAATCAT AAAATAATTG 6120 

AACTGGTGTC CAAAACTTAT CCTGAGTTGG AGCCATCGAA ATTAGGAACT ATGCTTTTTG 6180 

AGTATTCTAA GCTTTTTATG GAAAATAAGG AAGACAATAG TACAATGGAT TCATCAAACA 6240 

ATTTTCGTCA TCAATTAACT CAATCTCTAT TAAAGTCTCC AAACCTCATC CTCCGCGGTG 6300 

CTCCTGGCAC GGGAAAAACT TATCTTGCTA AAGAAATTGC TAAAGAATTA ACGGATGGCA 6360 

ACGAAGATCA AATCGGATTT GTACAATTTC ACCCATCATA TGATTATACG GATTTTGTAG 6420 

AAGGTTTAAG ACCAGTATCA AATGGGGATG GAGCTATTGA GTTTAGGCTA CAGGACGGTA 6480 

TTTTTAAAGA TTTTTGTCAG AAAGCAAAAG AAACCCAATT GATTGGAGGA CAAGATAATT 6540 

TTGATGAGGC TTGGGATTCT TACTTAGAAT ATATAAATGT TGCTGAAGAA AAAGAATATA 6600 

TAACAAAAAC ATCTTACTTA TCTGTTAATA GTAGACAAAA TTTGTCAGTA AATTATGATA 6660 

GTGGTGTTCC AGGATGGTCA CTACCTAGCA AATATGTTTA CGAGTTGTAT AAAGATAAAA 6720 

ATTATAATAA GCAAGAATAC TACAAAAGTG GTGGAAAAAC TGTCCTAGAA ACATTGAGAA 6780 

AGAGATTTGG TTTGAAAGAC TATGTTTCCC CAACAGAAAT TGATACTGAT AAGAATTTTG 6840 

TCTTCATCAT CGATGAGATC AATCGTGGGG AGATTTCTAA GATTTTTGGC GAACTCTTTT 6900 

TCTCTATCGA CCCCGGCTAT CGTGGTGAAA AAGGAAGTGT TTCTACCCAA TATGCAAATC 6960 

TACACGAAAC TGATGAAAAG TTCTATATCC CCGAAAATGT TTACATCATC GGAACTATGA 7020 

ATGATATTGA TCGTTCAGTG GATACCTTTG ATTTTGCTAT GCGTCGTCGT TTTCGTTTTG 7080 

TTGAAGTTAC TGTCGAGGGT CAAGCTGGCA TGTTGGATAA AGAGTTGAAT ATCCATGCAG 7140 

AAGAAGCAAA AATTCGTCTA AGAAACTTGA ACGCTGCTAT CGAAAATATT CAGGAATTAA 7200 

ACAGTCATTA TCATATTGGA CCAAGTTATT TTCTTAAGTT GAAGGATGTA GATTTTGACT 7260 

ATGAATTACT CTGGTCTGAT TATATTAAGC CTCTCCTAGA AGACTACTTG CGAGGTTCTT 7320 

ATGATGAGGT TGAAACTTTG GAaACTTTGA AAAAAGCATT TGATCTGACA AATAATGAGC 7380 

AAAAAGATCA GGCAGTAGCT GATGACAATG AAGGCGATGA AAACGATGAT GCGGATTAC.T 7440 

GATAATCAAC ACAAGATTAT TAAAGAAAAA TTTGTTGAAG AATATCCTAA ACTAAGCAAT 7500 

CCTCTTTTAG ACAGAACCTT GGAAAGTCTA TCCCAAGATG AACGTATTTT CATTTTTCCA 7560 

AATGATTwGA CTCATACTCC TGATTTGGAT AAGGACCAAA AGATTTTTGA AACAGTCAAT 7620 

CAGAAAATCA AGACAGGGAA CGTGATTGGT TTTCTTGGAT ATGGTCAGGA AAGATTAACG 7680 

ATTTCCTCAC GATTTTCTGA TGAGAGTAAT GACCACTTTT TGCATTATCT CTTAAACAAG 7740 

GTTCTTCATA TCAATCTCAC TAGTTTAGAT GTTGCTTTGT CTCGTGAAGA GAGGCTTTAT 7800 

CAACTTTTGG TGTATCTCTT TCCCAAGTAT CTACAAGCTG CTATTCGAAA AGGTCTTTAT 7860 
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AAGGAATATC ATCGATTTTC TCATAACGAC AGTCATGTTA 
AACCATCTCA AGAAAAATCT TCCTTTCACG GGAAATATTG 
ACCTATGATA ATCCCCTCAT GCAGTTGGTC CGTCACACTA 
AAAAGCATTG GTCAAGGGGT ACTAGATAAT CTCTCAACTA 
ATCGTGCGTG TAACGCCCTC TTATAAACTA GCTGATCGTG 
CAATCTAAAC CTATACGTCA TGCATACTTT CACGAGTACA 
CTGATGATCC TAAACCAAGA AAAGCACGGT TTAGGGTATC 
ATTCTCTTTG ATGTTGCCTG GCTTTGGGAA GAGTATGTTT 
TTTGTACATC CCAGAAATAA GGATAAGACG GATGGAATTT 
CGAAAAGTAT ATCCAGATTT TTATGACAGA GAACGAAAGA 
AAAAAACTGG AATTGACTGA AAAAGGAATC AACCGTGAGG 
TATTCTTATA TTTTAAAAGC TGAGAAGGCT GGACTGATTT 
GTAAATAGTG AAATAGGAAA AGTAGCTGGC TATGGAGCTC 
CGAATCCCTC AGAATGCCTC ATTCTATAGT ACATTTTGTA 
GAG 

(2) INFORMATION FOR SEQ ID NO: 178: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



AGGGAGTGAT 
CCTACGCAAC 
TTGAATACAT 
GTCGTGAAAA 
CTAAGATTAT 
GAAACTTACA 
AAGATCAAAA 
ACACCTTGTT 
CAGTATTTTC 
TTGTTCTAGA 
ACTTATTCCA 
TTCCTAGTAT 
AATTGAAGAA 
AAATGATGGA 



TGATGTAAGA 

GAGAGAGTTC 

TAAGAATCAG 

CGTATCTGAA 

TCGGGGAAAT 

AGAACTTTGT 

AATCTATGGT 

GCCAAAAGGT 

TGTTGGGAAA 

TGCAAAATAT 

GCTGATTTCC 

GGAGCAGTCA 

GTGGTCTATT 

AAATTCAGAA 



7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8703 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
CATCACCAGT TTTAGATGGC TTTAACAGTG AAATTATTGC TTTTAATCTT TCTTGTTCGC 
CTAATTTAGA ACAAGTACAA ACAATGTTGG AACAGGCATT CAAAGAGAAG CACTACGAGA 
ATACGATTCT CCATAGTGAC CAAGGCTGGC AATATCAACA CGATTCTTAT CATCGGTTCC 
TAGAGAGTAA GGGAATTCAA GCATCCATGT CACGTAAGGG CAACAGCCAA GACAACGGTA 
GGATGGAATC TTTCTTTGGC ATTTTAAAAT CCGAAATGTT TTATGGCTAT GAGAAAACAT 
TTAAATCACT TAACCAATTG GAACAAGCCA TTATAGACTA TATTGATTAT TACAACAATA 
AGAAAATTAA GATAAAACTA AAAGGACTTA GTCCTGTGCA GTACAGAACT AAATCCTTTG 
GATAAATTAT TTGTCTAACT GTTTGGGGGC AGTACACAAG AAAGCGCTTT AAAACCAGTA 



60 
120 
180 
240 
300 
360 
420 
480 
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GACCTTTTCA TAAGGTTCGC TTGATGTACC AAGATGAGGC TGGTTTCGGT AGAATCAGTA 540 

AACTGGGATC TTGTTGGTCT CCAATAGGAG TAGGTCCACA TGTCCATAGT CACTATATAC 600 

GAGAATTTCG CTATTGTTAT GGAGCTGTTG ATGCCCATAC AGGCGAATCA TTTTTCTTAA 660 

TAGCTGGTGG ATGTAATACT GAGTGGATGA ACGCCTTTTT AGAAGAGCTT TCACAAGCTT 720 

ATCCAGATGA TTATCTTTTA CTCGTTATGG ACAATGCTAT ATGGCATAAA TCAAGTACCT 780 

TAAAGATTCC GACTAATATT GGTTTTACCT TTATTCCTCC ATACACACCA GAGATGAACC 840 

CATTGAACAA GTGTGGAAAG AGATTCGTAA ACGTGGATTT AAGAATAAAG CCTTTCGAAC 900 

TTTGGAAGAT GTCATGAATC AACTCCAAGA TGTCATACAA GGATTGGAGA AGGAGGTGAT 960 

AAAGTCCATC GTTAATCGGA GATGGACTAG AATGCTTTTT GAAAACAGAT GAGTATAAAA 1020 

TTGAATTGCT TATAAAAAAG CTCCATACAC TGGATGTGTA TAGAGCAATG GGGCTTTATT 1080 

TGATATAGAG TTCTTGGTTT TTTAGGACAA TTTCTCGGAT ACTTGCAAAC TTTTTAAGTT 1140 

TTTTGATTTC TTCTGGATGA GTGACGAGAG TGATAACATA ACCTTCCTTG CCCATACGAC 1200 

CAGTACGGCC AGCACGGTGT GTGTAGGTTT CGCTATCTCT AGGAATATCA AAGTTTACGA 1260 

CACATTCTAG GCTATCGATA TCAATTCCAC GAGCCAAAAG GTCAGTTGCA AGAAGCAGGG 1320 

TTAGTTGGTT ATCTTTAAAC TTTTCTAAGA TGATTTTTCT AAATTTAACA TTAACATCAC 1380 

TAGCGAGGGA AACAGCCAAT ATATCACGAT ACTGTAGTTT TTCCTCGGCA TTCCCAAGGT 1440 

CTGACAGGCT ATTGAAGAAG ACTAGACCAC GGAAATCCTC TACATGAGCC AGTTTTCGTA 1500 

GCATATCCAC TCGATGACGT TGGTCTACCT GCATGTAGAA ATGCTGGATA TTGTCCAATT 1560 

TTTGATCAGA GAGATCAATA GTGCGTGTAT TCGGCACAAT CTTTTCTTGG TCAAACTTGG 1620 

TCGTGGCACT CATGTAGACC AGTTGGTGGT CACGAGGTGC GTAGTGAGTG ATTTTTTCTA 1680 

CAAAGTGAAT CTGAGAATCA TCTAGTAATT GGTCAAATTC ATCCAGGATG ATGGTTTCCA 1740 

C ATT CATC AT CTTGATTTTT TTAAGTTTAA TGAGTTCAAA GATACGGCCA GGAGTTCCAA 1800 

TCAGAATTTC TGGCCCCTTT TTAAGACGTT CAATTTGTCG TTTCTGACTT GAACCTGAAA 1860 

GGAAGAGTTG AGCAGTCAAT CCGATAGCTT CTGCCCACGT TTTACATACA TCAAAAATCT 1920 

GTCCAGCAAG TTCCGTATTT GGTGCTAGAA TCAAGAGTTG TTGGGCTTTT TTCTTTTGTA 1980 

GTCTGAGAAG ACTTGGTAGG AGATACGCTA GGGTCTTACC AGTTCCGGTT TGGCTCACTC 2040 

CTAGGAGGTT TTCTCCAGCA AGAAGGGGCT CAAATAGTTG AGTTTGAATG GGGGTGAATT 2100 

CTTGGAAACC GAGTTGGTCA CTCAGTTCTT GCCATTCAGT CGGTAGTTTG GTTTTCATTT 2160 

TTCTGCCTCA AATCTAATGC CAGCAGTCTG GCGCATGGTA TATAGTAGCT CATGAACAGA 2220 

GCCTGCATCA TACAGCCAAG TTTGGTAGAG ATTCAGATCT GGTTGCTGGA TCATGTGTGC 2280 
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AAATGCAGCG ACTTCCTCAG TCATCGTATG AGGAGCCTGT 


TGGATAGGAA 


GCTGGACTTG 


2340 


ATTTCCTTGG TGGTCGGTAA AAATAGCTGA 


GCGAATATGC 


TCAATCGTGT 


TGAGAGTCAA 


2400 


GGTTCCATCT GTTGTATAAA TCTCGCAAGG 


AAGATTGGAA 


GTGATGTTTT 


TTCCAGCCTT 


2460 


GATGTGAACT TGATAGTCTG GGTAGAAGAG 


GATACCATCT 


CCATTTAGGT 


CAATGCTATT 


2520 


GTCAAGCTGT TGAGCATGGT AAGTCGCGTC 


ATTGGCTTTT 


CCAAAAAGAC 


GAACAGCAGC 


2580 


- ATAGAGGGGA TAAATCCCCA AATCCATGAG 


GGCTCCACCA 


GCAAAACGGT 


CTGAAAAGAC 


2640 


ATTTGGTGTT TGTCCAGCCA ACAAGTCAGG 


CATCTTGGAA 


GAGTATTTGG 


CATAGTTGAA 


2700 


ATCTGCTCCT AACACTTGCT TATCTGCTAA 


AAAGTTTTTG 


ATAGTAGTAA 


AGGCTTTCTC 


2760 


GTGGTAATTA CGAGCTGCTT CAAAGATAAA 


ACAGTTATTT 


TTTTCAGCTG 


TTTGAATCAA 


2820 


ATCAAACCAT TCTTGTGGTT GAGAGACAGC 


TGGCTTTTCG 


AGAATAACAT 


GTTTACCAGC 


2880 


AGACAAGGCA GCTTTTGCCT GAGCAAAATG 


TAAGGAGTTT 


GGACTGGCGA 


TATAGACTAA 


2940 


ATCAAAAGAA GATTTGAAGA AGACTTCTAA TTGATCGAAT AGTTGGATAT 


TCTGATAGCG 


3000 


AGAAGCAAAG GTTGCTGCAG TTTCTAGTTT 


TCTAGAATAG 


ATTGCGACCA 


GTTGGTATTC 


3060 


TCCACTGGTA TGGGCTGCTT CTATGAAATG 


ATGGCTGATA 


GCGCCAGTTC 


CGATGACACC 


3120 


TAATTTTAGC ATAAATACTC CTTTTCCGAT 


TTTAAATCCT 


TCTTTCATTA 


TAACATAGAT 


3180 


AGACGGGACT ATCCAACAGA GAGGAGAAAA 


TTTCAAATAA 


GCTATTAGCT 


TTCTTTTCCG 


3240 


AATAAATAGA TAGAAGCATA GAATCTAGCA 


AACCTAGATT 


TAAAAATGTG 


CTATAATAGA 


3300 


AGGAGGAAAA GGAGGATTCT CAGACATCTA 


GGTATCAGCC 


CAACTAATGA 


TTTGTCAATT 


3360 


TATCCGCGAT ATGCTGGACT TGCCAGCAAA AAATGTGACG ATTTTGGAGG 


GAAGTAACAT 


3420 


TCACGTCTTG CCTTCCATGC CCTACTCAGC 


GTAAGATTTC 


TATACTAGTA 


TAGACGTCTT 


3480 


GGCGGAGTTA GATAATGGAA TCCAAGTTAT 


CATCGAAATT 


CAGGTTCATC 


ATCAGAATTT 


3540 


TTTCATCAAT CGCCTATGGC CTTATCTGTG 


CAGTCAGGTT 


AATCAAAACC 


TAGAAAAAAT 


3600 


TCGCCAACGT GAAGGTGATA CCCACCAGAG 


CTACAAACAA 


ATCGCACTAG 


TATACGCTAT 


3660 


CGCAATTGTC GATAGTAATT ACTTCTCAGA TGACCTAGCT TTTCATAGTT 


TTATAGTAAA 


3720 


ATGAAATGAG AACAGGACAA ATCGATCAGG 


ACAGTCAAAT 


CGATTTCTAA 


CAATGTTTTA 


3780 


GAAGTATAGG TCTACTATTC TAGCTTCAAT 


CTACTAGAAA 


TTCCATAGAT 


AGAAAACTAC 


3840 


ATAATCTCTA CAGATACGGA TGTTGGAGTT 


GATGTAAGAT 


GCTTTGGCTT 


GCTAGAGGAA 


3900 


TTGTGGATTG CCAAATTGTA TCATTGAAAT 


TATTGCTCAA 


ATTTGTTATG 


ATATAAATAT 


3960 


GAATAAAAGT AGACTAGGAC GTGGCAGACA 


CGGGAAAACG 


AGACATGTAT 


TATTGGCTTT 


4020 
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GATTGGTATT TTAGCAATTT CTATTTGCCT ATTAGGCGGA TTTATTGCTT TTAAGATCTA 4080 

CCAGCAAAAA AGTTTTGAGC AAAAGATTGA ATCGCTCAAA AAAGAGAAAG ATGATCAATT 4140 

GAGTGAGGGA AATCAGAAGG AGCATTTTCG TCAGGGGCAA GCCGAAGTGA TTGCCTATTA 4200 

TCCTCTCCAA GGGGAGAAAG TGATTTCCTC TGTTAGGGAG CTGATAAATC AAGATGTTAA 4260 

GGACAAGCTA GAAAGTAAGG ACAATCTTGT TTTCTACTAT ACAGAGCAAG AAGAGTCAGG 4320 

TTTAAAGGGA GTCGTTAATC GTAATGTGAC CAAACAAATC TATGATTTAG TTGCTTTTAA 4380 

GATTGAAGAG ACTGAAAAGA CCAGTCTAGG AAAGGTTCAC TTAACAGAAG ATGGGCAACC 4440 

TTTTACACTT GACCAACTGT TTTCAGATGC TAGTAAGGCT AAGGAACAGC TGATAAAAGA 4500 

GTTGACCTCC TTCATAGAGG ATAAAAAAAT AGAGCAAGAC CAGAGTGAGC AGATTGTAAA 4560 

AAACTTCTCT GACCAAGACT TGTCTGCATG GAATTTTGAT TACAAGGATA GTCAGATTAT 4620 

CCTTTATCCA AGTCCTGTGG TTGAAAATTT AGAAGAGATA GCCTTGCCAG TATCTGCTTT 4680 

CTTTGATGTT ATCCAATCTT CGTACTTACT CGAAAAAGAT GCGGCCTTGT ACCAATCTTA 4740 

CTTTGATAAG AAACATCAAA AAGTTGTCGC TCTAACCTTT GATGATGGTC CAAATCCAGC 4800 

AACGACCCCG CAGGTATTAG AGACCCTAGC TAAATATGAT ATTACAAGCG GGGT 4854 
(2) INFORMATION FOR SEQ ID NO: 179: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2186 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
TAAACAGGTG TTAGGTGCTC TAAACTATTA AAATTCTAAG GAAATAAGGC TACTTTTTCT 60 

GGGTCTTGTT CATAGTAGGT GTGGTTCTTT TTTTCGAGTG TAGCCCATAG CTTTGAGCGC 120 

ATAGTGGATG GTAGTTGGAT GACAGCCAAA TTCAGAAGCT ATTTCAGTCA AAT AAGCA r i C 180 

TGGATTGTCA GTAAGATAGT TTTTAAGTCT ATCTCTATCA ACTTTTCTTG GTTTTGTTCC 240 

TTTTACTTGG TGGTTTAGCT CTCCTGTTTT CTCTTTTAGC TTTAACCAGC CATAAATGGT 300 

ATTACGTGAG ATTTGGAAAA CGTGTGATGC TTCTGTTATA CTACCTGTTC GCTCACAATA 360 

AGAGAGAACT TTTTTACGAA AATCTATTGA ATATGCCATA AGAAGATTAT ACCACATTGT 420 

GTACTATTTT TGGTTCATTT TACTATATTT CTAAACACTT AGAAATAATA AAACAAATTA 480 

AATATTATTT CTAAATATTT GAAAATAACA TCTATTTGTA TTATACTATC TTTGAGGTAA 540 

CTATTATGAA CTATATCAAA AGACCACATT ATTTAGATTT TTTAAGAAAA CATCGTGACC 600 
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GACCAATCAT CAAAGTTGTG AGTGGAGTTA GACGAGCTGG TAAATCTGTG CTTTTTCAAC 660 

TCTATAAAGA GGAGTTACTA GCAACTGGGG TAGACGAGGA TCAGATTATA TTCATCAATT 720 

TCGAAGATTT GAGTTACTAT GATCTGCGAC ATTTTCAAAC ATTATTCGCT TATATAAAAG 780 

ATCAATTAGT TAGCAAGAAA ACATACTATA TCTTTTTAGA TGAAATTCAA TATGTTGAAA 840 

AATTTGAACT GGTAGCAGAT AGTCTATTCA TCTTAGCAAA TGTAGACCTC TATTTGACTG 900 

-GATCTAACGC CTACTTTATG AGTAGCCAAT TAGCAACAAA CTTGACTGGT CGGTATGTTG 960 

AGATAGAGGT TCTTCCTTTG TCATTTGAAG AATATCTATC AGGTCAATCT CTCACAGAGA 1020 

ATCTGAATAC AACAGAAATT TTTAACAATT ATCTCTTTAG TGCTTTCCCT TACTTATTGC 1080 

AAACATCATC TTACGATGAA AAAATTGACT ATCTCAGAGG AATATATAAC TCCATACTGT 1140 

TAAATGATAT TGTCACTAGA TTGGGAAAAC CAAATCCTAC TATTATTGAG CGCATTGTCC 1200 

GAACCCTTCT CAGTAGTACA GGTAGCTTAA TATCAACAAA TAAGATTCGC AATACCCTAG 1260 

TCAGCCAAAA TGTTTCAATA TCCCATAATA CTTTGGAAAA TTATTTGACA ACTTTGACAG 1320 

ATAGTTTACT TTTTTATTCC GTTCCACGTT TTGATGTAAA AGGTAGAGCA TTATTGCAAC 1380 

GTTTAGAAAA ATATTATCCC GTTGATTTAG GTTTACGACA TCTCTTATTA CCAGACCAGA 1440 

AAGAAGACAT TAGGCATATC TTGGAAAATA TGGTATATTT GGAATTGAGA CGTAGATATT 1500 

CACAAGTATA TGTTGGTAAT TTAGATAAGT ATGAGGTTGA TTTTGTTGTT GTAACTGATC 1560 

TTGGCCACTA CGCTTATTAT CAGGTCAGTG AAACAACACT TGCTCCAGAA ACACTAGAAA 1620 

GAGAACTTAG ACCACTAGAA GCCATTAAAG ATCAATTCCC TAAATATCTA TTAACAATGG 1680 

ATACGATTCA GCCAACAGCC AATTACAATG GAATCGAGAA GAAAAGCATT ATAGATTGGT 1740 

TACTAGAAAA ATAGATAAAT ATAAATCATA CAGCTAATTA GATTTGCAAC AGTCTGTTAT 1800 

CAATGATTCT ACCCAAATCC TAACAAGATA TAGTGAATTT CGAATACGCT ATATAATACG 1860 

GACACTTGAA AATAGAAATT GGGGATGAAA GGGGATCTAT AATTTCTGGA AGTACTATCA 1920 

AAAATTAATA TCATAGTCTT ATTAGAGAAT AGCATCACCC ACTTTCTCAA ATAAGATTAA 1980 

ATTGTAACTG AATTATAATG AAAAAGAGAC TGAGCAATCA GTCTTTAAAA TCAGAAAAGC 2040 

GCATAGTATC AGGTATTGAA CAACCTTGAT AATATGCGTT TTATTATGGA AATATTTGCT 2100 

TCATTTTCTC CTGAAATAGA GCTTTTGCTA TCCTATTTTT CTCTATTTCT AATGATTTAC 2160 

TTCAACTTCT TACCTCTTGG GAAAAA 2186 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 3236 base pairs 
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<B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1X1 ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


180: 






tj 1 LAtJACGTT 


TGACTTCACG 


TATTTCATAA 


GTATAAACTT 


TATTTTTATC 


GGTTAGATAA 


60 


A l'C TTCATGC 


CATTTTTAGC 


ATTATCTAAA 


GGAGAAAATA 


ACATTTTATT 


AGCATTATCA 


120 


ACALC-AAAGA 


TATGGTGACT 


AGCTAGArTA 


TAATTTCCTT 


CTCCCATTAC 


TTGCTCGCGT 


180 


TTCATTGTAC 


CAGCTCCGTA 


GAAGAGATTA 


ACATTATCAA 


GTCCTTTAAA 


AATCGGCAAA 


240 


TTCATTTCCA 


ATTCAGGAAT 


TGCAATTCCC 


CCAATAACTG 


GTAATTTTTG 


AGCATCCCAT 


300 


TGAGAAGTTA 


GAACAGCTTC 


CGAAGAGATA 


GCTTTGACAG 


AATCAAAGTC 


AAAATTGCCT 


360 


TCTGTATCCT 


GATTTTCTTC 


TA ATTTTTPT 


TTTGATACCT 


GGCTAACTTG 


ATACTTATTG 


420 


GTATTCCAGA 


CTATGAAAAT 


ATTTPC A ATT 


TGAGTATTAA 


AAATCAAAGC 


CAGTGACAGT 


480 


AATATCAGAA 






AGCAGATTTT 


TTCGCTTGTT 


TTTCTTTTTA 


540 


TTATTTTTTT 


GAGACATTAT 


fiPTTPl POTT 


CTGTTTCGTT 


TTCTGTCCCA 


ACTTCTTCTT 


600 


TTTCTGCCAC 


CGCAACPfiTT 




CTATCTGAGC 


ATCTTGATCC 


AGGCGCATTA 


660 


CTTTAACTCC 


CATAGTTGCA 


CGTPPTGTTT 


GTGAAATATT 


GGCAAGATTG 


GTTCGAATCA 


720 


TGACACCTGT 


ATCAGTGATA 


ATCATCAAAT 


CCTCATCCCC 


TTGAACAGTC 


ATAAGACCGG 


780 


L.CAGCAAGCC 


ATTTTTTTCG 


GTAATTTTAG 


CTGTCTGCAT 


TCCCTTACCA 


CCACGACCTT 


840 


T w TV2T |f rr , /VT' a 

■L 1 V3 I 1 1 a 


TTCAGTAGCG 


ACTGTACGCT 


TACCATATCC 


TTTTTCTGTG 


ATAATAAGAA 


900 


CCTCATCTTG 


ATCAGTAATC 


AAGCTGGCAC 


CAACAACTGT 


GTCTCCTTCA 


CGAAGGTTAA 


960 


CACCTTTCAC 


ACCAGTGGCG 


ATACGGCTCA 


TACCACGAAC 


GGCTGATTGA 


TTAAAGCGAA 


1020 


CTGCATAACC 


AAACTTGGTA 


CCAATGATAA 


TATCCATATC 


TCCTTCTGCC 


AACAAGACAT 


1080 


TGATTAACTC 


ATCTTCATCC 


TTTAAATTCA 


GCGCTTTGAG 


ACCATTTTGA 


CGAATATTGG 


1140 


CAAACTCCTT 


AACACTGGTT 


CTCTTCACAA 


TACCGTGACG 


GGTTGTAAAG 


AAGAGATAAG 


1200 


CATCATCACT 


GCGATCAGAC 


TCAACATTGA 


TAACCGTCTG 


AATACTTTCG 


TCTTCATCCA 


1260 


ATTTCAAGAG 


ATTGACTACT 


GGTAGCCCTT 


TGGCAGTCCG 


ACCATACTCA 


GGAATTTCAT 


1320 


AACCTTTAAG 


ACGATAGACA 


CGTCCCTTGT 


TTGTGAAGAA 


GAGCAGATGA 


TCATGGGTGC 


1380 


TAGTTGACAC 


TAACTCACGA 


ACAAAGTCAT 


CATCTTTCAC 


TCCCGTTCCT 


TGGACACCAC 


1440 


GACCCCCACG 


TTTTTGAGCA GTGAACTCGT 


CCTGATCCAA 


ACGCTTAATG 


TAGCCTCTGT 


1500 


TAGAAAGGGT 


AATCAAGACA 


TCCGATTCTT 


CAATCAAGTC 


CTCATCCTCG 


AGACTCAAGA 


1560 
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CCTGTCCAAT CATCAACTCT GTACGGCGCT TATCAGAAAA TTTACGTTTA ACTTCATCCA 1620 

ATTCGTCTTT GATAATTTGA GAAACACGTT CAGGCTTAGC AAGAATATCT GCTAAATCCG 1680 

CAATCAGAGC CAAGAGGTCA TCATACTCAG ATTGAATCTT ATCGCGTTCC AAACCTGTCA 1740 

AACGACGAAG ACGCATATCA AGGATAGCTT GACTTTGACG TTCAGAAAGC TTAAACTTGC 1800 

TCATCAACTC AGCTTGAGcT TCCGCATcCG tTTCACTAGC ACGGATGATA CGAATCAyTC 1860 

X3TCGATATGG TCTAGCGCAA TCAAGAGACC TTCTAAGATA TGAGCGCGCG CTTCCGCTTT 1920 

TTCCTTATCA AAACGTGTAC GACGAACAAC CACTTCTTTT TGGTGCTCGA TATAAGCATC 1980 

CAAAATCTGA CGAAGAGAGA AAATTTTCGG TATACCATTT TGGATAGCGA GCATATTGAA 2040 

ACCAAAATTG GTTTGCATTT GGGTCATTTT GAAGAGGTTA TTGAGAATAA CATTGGCTGA 2100 

GGCGTCGCGC TTGACTTCAA TAACAAATCG AACACCTTCA CGGTTTGACT CATCACGTAC 2160 

TGCTGTGATA CCCTCAATGC GTTTTTCCTG AACCAAGCGA ACAATATGCT CATGCACCTT 2220 

GGTTTTATTG ACCATGTAAG GAAATTCTGT TACAACGATA CGCTCACGAC CAGTCTTAGT 2280 

CGTTTCAATC TCTGTACGAG AACGTAGGAC AATCGAACCT TTACCTGTTT CATAAGCCTT 234 0 

ATGGATACCT GATTTCCCCA TGACAAGAGC ACCAGTTGGA AAATCTGGTC CAGGCAAGAC 2400 

TTCCATCAAG TCCTTGGTAG TCACTTCAGG ATTATCCATG ACCAACTTCA CTGCATCAAT 24 60 

gGTTTCACCC AGATTATGAG GTGGAATATT GGTTGCCATC CCAACCGCGA TACCAGTTGC 2520 

TCCATTAACC AAAAGGTTTG GAAAACGCGC TGGCAAGACC AAGGGTTCCC GTTCATTGGC 2580 

ATCATAGTTA TCAACGAAAT CAACTGTATT TTTGTTGATA TCACGAAGCA TTTCCAGAGC 2640 

AATCTTGCTC ATACGTGCCT CGGTATAACG TTGAGCGGCA GCACTATCTC CATCCATGGA 2700 

ACCAAAATTC CCATGACCAT CTACAAGCAT GTAACGGTAG CTCCACCATT GAGCCATACG 2760 

GACCATGGCT TCATAAATAG AGGAATCCCC GTGTGGGTGA TATTTACCCA TGACATCCCC 2820 

TGTAATACGA GCAGATTTTT TATGGGGTTT GTCTGGGGTC ACACCCAATT CATTCATTCC 2880 

GTAGAGAATG CGACGGTGAA CAGGTTTTAA GCCATCTCGA ACATCAGGAA GAGCTCGCGC 2940 

TACGATAACA CTCATGGCGT AGTCGATAAA ACTTGCCTTC ATCTCCTTTG TCAGATTGAC 3000 

ATT C ACT AAA TTTTTATCCT GCATTAATAA ATGCCTCATT TCACAATTAG TAAGTAACAA 3060 

CATTATACCA TAAATTCCCA TCTATTTCAG CCTCTAAACC ACTAAAACGT TTACATCGAG 3120 

AACTATAAGG CATATTCGTG ACAAAGTTTT TTAAAAGTGA TAGAATGAAG TTGTCTAGGG 3180 

AAAACCCCTA ATAGAATAAG GAGATGGTTA nACAATGACT CTGACTAACA CACAAA 3236 
(2) INFORMATION FOR SEQ ID NO: 181: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



AGGTCCTGAA GTATTGGAAC AGGAAGGTCA AGAGTTTTTG 


GAACATTTCA 


AAAAACTCTT 


60 


GGAGTCAGTT 


GAAGTAGTAG 


CCATCTCAGG 


TAGTCTGCCA 


GCTGGCCTTC 


CAGTTGATTA 


120 


CTATGCGAGC 


TTGGTAGAAC 


TTGCTAATCA AGCTGGCAAG 


CATGTAGTCT 


TGGACTGCTC 


180 


AGGTGCAGCA 


CTTCAGGCTG 


TTCTTGAATC 


ACCCCATAAA 


CCAACAGTCA 


TCAAACCAAA 


240 


TAATGAAGAA 


TTGTCTCAGC 


TTCTTGGAAG 


AGAAGTTTCT 


GAGGATTTGG 


ATGAATTAAA 


300 


AGAAGTACTT 


CAAGAACCTT 


TGTTTGCAGG 


GATTGAATGG 


ATTATCGTTT 


CACTTGGTGC 


360 


CAACGGTACT 


TTTGCCAAAC 


ATGGTGACAC 


TTTCTACAAG 


GTAGATATTC 


CTAGAATTCA 


420 


GGTGGTAAAT 


CCTGTTGGAT 


CTGGAGACTC 


TACTGTGGCA 


GGAATTTCTT 


CAGGACTTCT 


480 


TCACAAAGAA 


TCGGATGCAG 


AATTACTCAT 


CAAGGCAAAT 


GTCCTTGGTA 


TGCTCAATGC 


540 


TCAAGAAAAA ATGACTGGTC ATGTCAACAT GGCCAACTAT 


CAAGCTCTAT 


ATGATCAATT 


600 


AATAGTAAAA 


GAGGTATAAA 


ATGGCTTTAA 


CAGAACAAAA 


ACGTGTACGC 


TTAGAAAAAC 


660 


TTTCTGATGA AAATGGTATC ATCTCAGCTC 


TTGCATTTGA 


CCAACGTGGT 


GCTTTGAAAC 


720 


GCCTCATGGT TAAACACCAA ACAGAAGAAC CAACTGTGGC 


CCAAATGGAA 


GAACTTAAAG 


780 


TCTTGGTAGC 


AGATGAATTG 


ACTAAATATG 


CTTCATCTAT 


GCTTCTTGAC 


CCTGAGTATG 


840 


GACTTCCAGC AACTAAAGCT CTTGATGAAA AAGCTGGTCT 


TCTCCTTGCT 


TATGAAAAAA 


900 


CAGGTTATGA 


CACAACAAGC 


ACAAAACGCT 


TGCCAGACTG 


CTTGGATGTT 


TGGTCTGCAA 


960 


AACGTATTAA AGAAGAAGGT 


GCAGATGCAG 


TTAAATTCTT 


GCTTTACTAT 


GATGTAGATA 


1020 


GCTCAGACGA 


ACTCAATCAA 


GAAAAACAAG 


CCTACATCGA 


ACGCATCGGT 


TCTGAGTG'J'G 


1080 


TGGCTGAAGA 


TATCCCATTC 


TTCCTTGAAA 


TCCTTGCTTA 


CGATGAAAAA 


ATTGCGGATG 


1140 


CAGGTTCTGT 


AGAATACGCT 


AAAGTAAAAC 


CACACAAAGT 


TATCGGCGCT 


ATGAAAGTCT 


1200 


TTTCAGACCC 


ACGCTTTAAC 


ATTGATGTTT 


TGAAAGTTGA 


AGTTCCTGTT 


AACATTAAAT 


1260 


ATGTTGAAGc 


kTCGCTGAAG 


GTGAAGTAGT 


TTATACACGT 


GAAGAAGCAG 


CAGCCTTCTT 


1320 


CAAAGCGCAA GATGAAGCAA 


CGAACTTGCC 


ATACATCTAC 


TTGAGTGCTG 


GTGTATCAGC 


1380 


TAAACTCTTC 


CAAGATACTC 


TTGTATTTGC 


TCATGAATCA 


GGTGCGAACT 


TTAACGGAGT 


1440 


TCTTTGTGGC 


CGTGCTACAT 


GGGCAGGATC 


AGTTGAAGCT 


TACATCAAAG 


ATGGTGAAGC 


1500 
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AGCAGCTCGC GAATGGtCGC ACAACTGGAT TTGAAAACAT TGACGAACTC AACAAAGTTC 1560 

TTCAAAGAAC AGCAACTTCA TGGAAAGAAC GCGTGTAAGA AAGTCCTCCT AGTTTAGGAA 1620 

CATGAATCTA AAAAAATTTA AAAAAAGTTG TATGTAAAGG CTTACAAAAT AACTTACTTG 1680 

TGCTATACTT AAATCACAAG TTAATATGAA TTAGAAAGTA ACTATATGAA GTATAATAAA 1740 

AATAGGATAT AGTTTATTTT ACGAGCTAGG AAGGAAAAAT ACGGAAACAA TATTGCCAGA 1800 

ATAAACTATA TTTAGATGCA CATTTCATTC ATTGTTTTAT AAAAGGAGAA GATAAACGGC 1860 

TACTAAAAAG AGTTTTAAAG CGTTAGTTGT AGGACTAGGT ATTGTTTCAA TATTCTTATC 1920 

AGCCTTACCT ATGGTTAGTG GTTCTGTATT TGCAGATAGT GCCCTAACTA CAGTAGATAA 1980 

AGCAAATGAT ATTGTTTTGA ATGTTGATGG GAATAAATTT TATAATGTTT CGGTTTCAGA 2040 

AGATATTGTA AATGCTGGTC AAATTTTGGA AGATTATTTT TATGTAGATA AATTTGGAAA 2100 

TATAAATTTA AAAGGCACTC CTGAAGAGTT AGCAAAAAAT ATTGGTATTT CTGTACAAGA 2160 

AGCAAGTTTG ATGTATGGAG CTGTAAAAGA GTTACCCAAC GTTTACGAAA GAGGTCCTGT 2220 

AGGTTTTCGT TTCAATCTTG GTCCTCAAGT GAGGGGGATG GGTGGCTGGG CTGCTGGAGC 2280 

TTTCGCTACT GGATATGCTG GATGGCATTT GAAACAATTT GCGGTTAATC CTGTTACATC 2340 

TGGATTTGTT GCTGTAATAA GTGGTGCGAT TGGCTGGGCT GTAAAAACTG CTGTAGAAAA 2400 

TTATTGGACA GTTGCTGTAG CTACAGTAGA AGTGCCGTTT GTGAACCTTG TTTACACCAT 2460 

AGATTTACCT TAGAGGTTAT TTCTTTATGA ATCATTCTTT TAAAAAAATA ACTGTATTTT 252 0 

GTTTTATAGT TTCTTGTGTT CTTTGTTTAT TAGACTTAAT GAATTTTAAA AATGTAGCTA 2580 

CTTTTTTATT TTTCTGTCTT CCTGTTTTTG TTTTGATTTA CAAAAATAAA TAAAAACAGA 2640 

GCCTCTGTTT GATGAATTTT AGAACATAGT TAAGTTTTAA AAAAAGTTGT ATGTAAAGGT 2700 

TTACAAAATA ACTTACTTGT GCTATACTTA AATCACAAGT TAATACAAGG TGAGTGTTAC 2760 

TAAGTAATAT TAGGCATGAT CACAGGTGAA TTAGAAATCA GCTGATTTTC TAGTTCATTT 2820 

GTGGTCATTT TTTGTACTTA TATACCTTTA AGATATAAAA GGAGGTTGAC ATGTATCGAA 2880 

TTCTAAATCC AATGAATCAC AATGTCTCGC TTGTCAGAAA TGATAAGGGA GAAGAGGTGA 2940 

TTGTAATTGG TAAGGGAATT GCATTCGGAA AGAAGAAGGG GGATTTGATT GCTGAAAATC 3000 

AGGTTGAGAA AATCTTTCGG ATGAAGACCG AAGAGTCCAG AGAAAACTTT ATGGCTCTTC 3060 

TCAAAGATGT TCCGCTTGAT TTTATCACAG TGACCTATGA AATCATTGAT AAGCTATCAA 3120 

AGAAATATCA TTATCCGATT CAAGAGTATC TCTATGTAAC CTTGACAGAT CATATTTACT 3180 

GTTCTTATCA AGCTCTAACT CAAGGAAGGT ACAAGGATAG TAATCTGCCA GATATTTCCG 3240 
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CTAAGTATCC 


TGTCGCTTTT 


CAAATCGCAA 


ATGAAGCTTT 


TGAAATTTAC 


CGTCAGAAGC 


3300 


TAGCAGATCA TTTTCCTGAG GACGAAATTA TTCGGATTGC 


TTATCATTTC 


ATTAATGCTG 


3360 


AAGGTGAAAA TGAAGTGGAA CTTGTGGAGT CGATTGATAA 


GAGGAAAGAA 


ATTCTCAGGA 


3420 


ATGTTGAAGA AGTTTTAACG 


GACTATGCAA 


TTCAACGAAC 


TAAAAAGAAT 


AACCATTTCT 


3480 


ATGATCGCTT 


TATGATCCAT 


TTGAATTATT 


TCTTGGATTA 


TTTAGACAGA 


TCTAGAGATG 


3540 


ATAACCAATC 


ACTTCTGGAT 


ATGGAAGATC 


ATATTAAACA 


ATCCTATCCA AAAGCCTTCG 


3600 


AGATTGGTTC 


CAAGATCTAT 


GATGTGATTA 


CGCAACATAC 


GGGTCTTGAT 


TTGTATAAAA 


3660 


GTGAACGAGT 


TTATCTAGTT 


CTACATATCC 


AACGTTTATT 


GTCATAAAAA 


TTTATTTAAA 


3720 


ACTATATAAG 


GAGAATTCTA 


TCATGAATAG 


AGAAGAAGTA 


ACATTGTTAG 


GTTTTGAAAT 


3780 


CGTAGCCTAT 


GCTGGCGATG 


CTCGTTCAAA 


ACTATTGGAA 


GCCTTGAAGG 


CTGCTGAAGC 


3840 


TGGTGATTTT 


GAAAAAGCGG 


ACGCTCTGGT 


AGAGGAAGCT 


GGTAGCTGTA 


TTGCAGAGGC 


3900 


TCACCACGCG 


CAAACAAGTC 


TATTGACTAA 


GGAAGCTTCA 


GGTGAGGACT 


TGGCTTATAG 


3960 


TGTAACCATG ATGCATGGCC AAGACCACTT AATGACAACT ATCTTGTTAA 


AAGATTTGAT 


4020 


GCATCATTTA 


ATTGAACTCT 


ACAAGAGAGG 


AGTTCAATAA 


TGAATAAACT 


AATTGCATTT 


4080 


ATCGAGAAAG 


GAAAGCCTTT 


CTTTGAAAAA 


CTATCTCGTA 


ATATCTATCT 


TCGTGCTATT 


4140 


CGTGATGGTT 


TCATTGCAGG 


TATGCCTGTT 


ATTCTCTTCT 


CAAGTATCTT 


TATCTTGATT 


4200 


GCCTTTGTAC 


CAAACTCATG 


GGGCTTTAAA 


TGGTCTGATG 


AAGTTGTAGC 


CTTTCTGATG 


4260 


AAACCTTATA 


GCTATTCTAT 


GGGTATTCTG 


GCTCTCTTGG 


TAGCTGGTAC 


AACAGCTAAG 


4320 


TCATTGACTG 


ACTCAGTAAA 


CCGGAGCATG 


GAAAAAACCA 


ATCAAATCAA 


GTATATGTCA 


4380 


ACATTGTTGG 


CAGCAATTGT 


TGGTTTGTTG 


ATGTTGGCAG 


CTGATCCTAT 


CGAAAGTGGT 


4440 


CTAGCTACTG 


GATTCTTGGG 


GACAAAAGGT 


TTGCTTTCAG 


CCTTCCTTGC 


TGCCTTTGTT 


4500 


ACTGTAGCCA 


TCTATAAGGT 


TTGTGTTAAG 


AACAACGTCA 


CTATTCGTAT 


GCCTGACGAA 


4560 


GTTCCACCAA 


ATATCTCACA 


AGTCTTTAAA 


GATGTGATTC 


CATTCACTCT 


ATCTGTTGTT 


4620 


TCTCTTTATG 


CTCTTGACTT 


ATTAGCACGT 


TATTTTGTTG 


GTTCTAGTGT 


GGCAGAATCA 


4680 


ATCGGTAAAT 


TCTTCGCACC 


ACTCTTCTCA 


GCAGCAGACG 


GATACCTTGG 


TATTACCATT 


4740 


ATCTTTGGTG 


CCTTTGCCTT 


CTTCTGGTTT 


GTTGGGATTC 


ATGGTCCATC 


TATCGTTGAA 


4800 


CCAGCTATCG 


CAGCTATTAC 


CTATGCCAAT 


GCCGAAGTTA 


ACTTGAACCT 


TCTCCAACAA 


4860 


GGGATGCATG 


CAGACAAGAT 


TCTTACTTCT 


GGTACACAAA 


TGTTTATCGT 


TACCATGGGT 


4920 


GGTACAGGTG 


CGACATTGGT 


CGTTCCATTT 


ATGTTCATGT 


GGTTGACAAA 


ATCGAAACGT 


4980 


AACCGTGCAA 


TCGGACGTGC 


TTCAGTAGTT 


CCTACCTTCT 


TCGGTGTAAA 


TGAACCAATC 


5040 
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TTGTTTGGTG 


CACCTCTTGT 


TTTGAATCCA 


ATCTTCTTCA 


TTCCATTTAT 


CTTTGCTCCA 


5100 


ATTGCAAACG 


TATGGATTTT 


CAAATTCTTT 


ATTGAAACTC 


TTGGAATGAA 


CTCATTCACT 


5160 


GCTAATCTAC 


CATGGACAAC 


TCCAGCTCCA 


CTAGGTCTAG 


TTCTTGGAAC 


TAACTTCCAA 


5220 


GTGCTATCAT 


TCATTCTTGC 


TGCCCTTCTA 


ATCGTGGTTG 


ACGTTGTCAT 


TTACTATCCA 


5280 


TTCCTTAAGG 


TCTATGATGA 


ACAAATTCTT 


GAAGAAGAAC 


GTTCAGGTAA 


GTCTAATGAT 


5340 


- GAATTGAAAG 


AAAAAGTTGC 


TGCAAACTTC 


AACACTGCAA 


AAGCGGATGC 


TATTCTTGAA 


5400 


AAAGCGGGTG 


TCGATGCAGC 


ACAAAATACC 


ATCACTGAAG 


AAACAAATGT 


CCTCGTTCTC 


5460 


TGTGCAGGTG 


GAGGAACAAG 


TGGTCTCCTT 


GCAAATGCTT 


TGAATAAGGC 


AGCAGCAGAA 


5520 


TACAATGTCC 


CTGTGAAAGC 


AGCAGCAGGC 


GGCTATGGTG 


CTCACCGTGA 


AATGTTACCA 


5580 


GAGTTTGATC 


TTGTTATCCT 


TGCCCCTCAA 


GTTGCTTCAA 


ACTTTGAAGA 


TATGAAAGCA 


5640 


GAAACAGATA AGCTCGGTAT TAAACTAGCG AAAACAGAAG GCGCTCAATA 


CATCAAATTA 


5700 


ACTCGTGATG 


GAAAAGGTGC 


TCTTGCATTC 


GTACAAGCGC 


AATTCGATTA 


AGGCTAGAGA 


5760 


CTCTGAAATA 


GTCTCCCATC 


GTTACGGAAA 


TCGCTATGGC 


GAATTTCCTA 


TTATTAATTC 


5820 


GTCGGTAAAA 


AGATATCGTT 


TTTACCTCCT 


CATGTCACAA 


TTCGGTGACT 


TGGTACAAGA 


5880 


AGTGAGATGG 


AGAAGGATGG 


CTCACTGACT 


CCTCTCCTCT 


CACTTTTACT 


TTATTTAAAT 


5940 


CAAGAAATAG 


GTGAAAAAAA 


TGACAAAAAC 


ACTTCCAAAA 


GACTTTATTT 


TTGGTGGCGC 


6000 


AACAGCTGCT 


TATCAAGCAG 


AAGGTGCTAC 


ACATACTGAT 


GGAAAAGGAC 


CAGTTGCTTG 


6060 


GGATAAATAT 


CTTGAGGATA 


ACTACTGGTA 


CACTGCCGAA 


CCAGCTAGTG 


ATTTTTACAA 


6120 


TCGATATCCA 


GTTGACCTCA 


AGCTAGCAGA 


AGAGTATGGT 


GTCAATGGTA 


TTCGAATTTC 


6180 


TATTGCTTGG 


TCACGTATTT 


TCCCGACTGG 


TTACGGCCAA 


GTAAATGCTA 


AAGGTGTTGA 


6240 


GTTTTATCAT 


AATTTATTTG 


CAGAGTGTCA 


CAAACGTCAT 


GTTGAGCCTT 


TTGTAACTCT 


6300 


TCATCACTTT 


GACACGCCAG 


AAGCTCTCCA 


CTCAAATGGA 


GACTTCTTAA 


ACCGTGAAAA 


6360 


TATCGAACAT 


TTTGTAGACT 


ACGCTGCCTT 


CTGTTTTGAA 


GAATTTCCAG 


AAGTAAACTA 


6420 


TTGGACAACC 


TTTAATGAAA 


TTGGACCAAT 


CGGTGATGGT 


CAATATTTGG 


TTGGGAAATT 


6480 


CCCTCCAGGT 


ATCCAGTACG 


ACCTTGCCAA 


AGTCTTTCAA 


TCACACCACA 


ATATGATGGT 


6540 


GTCTCATGCA 


CGCGCGGTAA 


AATTGTACAA 


AGAGAAAGGC 


TATAAAGGGG 


AAATTGGTGT 


6600 


TGTTCACGCC 


CTGCCAACTA 


AATATCCTCT 


AGATCCTGAA 


AATCCAGCAG 


ATGTTCGTGC 


6660 


AGCTGAGTTG 


GAAGATATCA 


TCCACAATAA 


ATTCATCTTA 


GACGCAACTT 


ATCTAGGTCG 


6720 


CTATTCAGCT 


GAAACCATGG 


AAGGTGTCAA 


CCATATCTTA 


TTAGTCAATG 


GTGGTAGTTT 


6780 
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GGATCTTCGT GAAGAAGATT TTACAGCATT AGAAGCTGCA AAAGACTTGA ATGATTTCCT 6840 

AGGAATCAAC TACTATATGA GTGACTGGAT GGAAGCCTTT GATGGAGAAA CTGAAATTAT 6900 

CCATAATGGT AAAGGTGAAA AAGGAAGCTC TAAGTATCAA ATCAAAGGTG TTGGTCGTCG 6960 

TGTAGCTCCT GACTATGTAC CACGCACGGA TTGGGATTGG ATTATCTACC CTCAAGGTTT 7020 

GTATGACCAA ATCATGCGTG TGAAGAAAGA TTATCCTAAC TACAAGAAGA TTTACATCAC 7080 

TGAAAATGGT CTCGGCTATA AAGATGAGTT CGTTGATAAC ACTGTTTACG ATGATGGTCG 7140 

TATTGATTAC GTGAAGCAAC ACTTGGAGGT TTTATCTGAT GCGATTGCAG ATGGAGCTAA 7200 

TGTAAAAGGT TACTTCATTT GGTCATTAAT GGATGTCTTC TCATGGTCAA ACGGTTATGA 7260 

GAAACGTTAT GGTCTCTTCT ACGTAGATTT TGAAACTCAA GAACGTTATC CTAAGAAATC 7320 

AGCTCACTGG TACAAGAAAG TAGCGGAAAC TCAGATTATA GACTAGTAGA ATTAGTCATT 7380 

AGATATAGAA TTTTAGTGAG TCAAAAAGAT GTTCAAAGAT TTTATCCAAT CTATTTATGA 7440 

AAAAAAGTTT ATATTATAAA TTTCGAAAAA TGCTCTCAAA TACCGTGTTT GACGAGTGAA 7500 

GAATTGAAAA GTCTTGGAAA ATGGTATGTC TCGACTGGTA AAGAATGGAT TTGTCATTCA 7560 

GATGATGAGC TGGAAGAATT TAAAAATCTA TTTTTAAATT TTATCAATCC TGAAGAATGG 7620 

GATACTATCT CCTTTGATTC AGATTTTATG CCGTTTCAAC AATCGTAACC AATTTCTCAA 7680 

AAAAGTTAAA TCTTATATTT AGTACTCTGT AAAACTCTTA TCTAATCACG TTGCTTATAC 7740 

TCAATGAAAA TCAAAGAGCA ACTTTAAACT AGGAAGCGAG TCGCAGATTT CTCAATGCAT 7800 

AGCTTTGAGG AATTGGGCAA AAAGTCTTTG ATATAGAAAA ACGCATAGTA TCAGGTGTTT 7860 

CAACACCTGA TACTATGCGT TTTATTGTGG GAAGATTTAC TTTTTTTCTT CTGAAATTGA 7920 

GTTGTTACCC AGGCTCTTTC AGTTTATTAA GGCTTGATGA CTTTAATGTG TTTAGATAGC 7980 

TTAAAAAGGA TTGAATCACT TAGTTTAGAA TCTGAAACAA TAGTATCAAG ATTTGATACA 8040 

TTATAAAAAG TATAAAAATC AAACTTATTG AACTTGCTAT GATCTGCGAG TAAATATTTT 8100 

TTATTAGAAT TATTTAAAGC GATGCGTTGA GCCTCTCCCT CTTCCTCGCT AAAAGTAGCT 8160 

AGAGCTCCGT TTTGAATACC ATTACAGCTA ACGAAAGCTT TAGAAAATTG GAGATTAGAG 8220 

AGATTTTGTA GGGTCAATGT ACCAACAAAA GCACCTGTAA TATCGCGATA ATTTCCACCT 8280 

ATTAAAATCA AATCTGTTAA TTTTCGTTCG CTTAAAATCA GAAAAACAGG TAGACTGTTG 8340 

GTTACGACGC GGATATTGTC AATAGGCAAC TCACGCGCAA AAAACTCTAA TGTTGTTCCT 8400 

GGTCCAATGA AAATAGTTTC TCTTTCTTCT ACTAGACTGC CTGCAAAATG GGCTATTTCT 8460 

TGTTTTTCTG CCGTTTGGAG GGCTTGTTTT TCAATATTTG ATCGCTCATT AGTCAAAAGG 8520 

GAGTTGGTTC GAAGTTTTTC AGCTCCACCA TGCACACGAA TCAGCAAATC TTTATCAGCT 8580 
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AATTCCTGTA AATAGCGCCT TGCAGTCATA TCTGAAACGG CTATTTCGTC CATAATCTGT 8640 
TTAACTGTTA T 8651 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3786 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 



AATCTCCAAT 


CAGTGCCACT 


TCAGCTACAA 


AGAAGAGGAG 


GATAATAACT 


CCGTTCACAA 


60 


GGACAGACAA 


GAATAATTGA 


TAGAAGGAGT 


CGGTTTCACT 


TGCTTGACTT 


GGTCTTGTAA 


120 


TGATwTGGAG 


ACTGGCAAGC 


AGAATGATTC 


CAATGCTAAT 


CACACACAAG 


AGGGCTGTAA 


180 


ATCGTAGGCT 


ATCAAAGAAA 


GCAAAGAAAC 


TAGCAATAGC 


AGTGAGGAmG 


ATTGGAATTG 


240 


CCAAGAGTTG 


ACTATATTGT 


TGGAGAACCT 


TGTCTAGCGT 


CCAGTCCTTT 


TCCTGGTGGA 


300 


TAAATCGTCT 


CACAACGAAA 


CTACCCAAGA 


GGAATGAAAA 


GAAGAAGAGT 


GTTGTCGCTA 


360 


CTAGGATAGA 


GATGATAGAA 


AAAAGAGTTA 


AAGGAGCTAG 


CTGCTCAGGG 


AAGCGACTGT 


420 


TAATGCTTGC 


TATATGTCCA 


TAGTAAGCAT 


GTTTGATGTG 


ATAGATACTA 


AAGAAAAAGG 


480 


AAGATGCAGA 


AAACAGAATG 


AGCAAGAGAA 


AGGCTGTGTA 


ACTGTGTGTG 


ATACTTGTTT 


540 


CCAACTTACT 


TGTAGGAGAT 


TTGATCGCTT 


CCACTAGCCA 


AGACCAAAAA 


TCAAGCACTT 


600 


GCTCTTTCCA 


TTTATCCCTA 


GATTTTGGAG 


CTTGGTCGGG 


GATATAAGGA 


CTTTCTAAAG 


660 


ATTTACTGAT 


AAGAAGTGGC 


TCTTTCGTGG 


TTGCTTTTTG 


CTGAGGAAGA 


GCTTCTTGGC 


720 


TCTCTTCAGC 


TATAGTGACT 


TTTTCTGTTT 


CTTTAGAAAG 


GTCTGGCTCT 


TCTTCAGTAG 


780 


AATTAGATGC 


CTTCTTTTCT 


TCTATTTCTG 


TTCTCGCTTC 


ACTGTCTTCA 


GGAGCTTCAA 


840 


TTTTCTCTTC 


TTGCTGGCTT 


TCCAATTCGA 


CTTCAGCTTG 


AGGGACTTCC 


TCCTCTAACT 


900 


GAGTATTTTT 


TTCAATTGGT 


GTATCGAGAT 


CGGCTATCGT 


TTCTTCAGCC 


TTGTCTGCAA 


960 


CCTCTTGAGC 


TTGCTCTTCA 


GGCTTGTTCT 


TGCTTGTTGT 


TTTTACAAAA 


TCATTACTTT 


1020 


CAAACCATTC 


TTGTTTCATG 


GTAGAACCTC 


CTTTTTAGTT 


AGATAAATAT 


GTTTCCATAG 


1080 


TAGCAAATGT 


AAGCGTTTTT 


GTCAACGTCT 


GCTTGGTGTG 


GATATTAGAT 


CAATATTATC 


1140 


ATCAGATCTC 


GCAATGAGTT 


GATCCTTGAC 


ATCGGTTTTT 


TCAGTTTTGT 


AAGGGTTGCT 


1200 


TAATTCCGTA 


CCTCTTGATT 


CAGGCTTTTC 


TCTTGTGAAT 


TGGAAGATAG 


AACCATAGTT 


1260 
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laH IXjAGATG 


TCCCAGTTAA TTCGTTGGCT TTCTTTCTGG 


TCTAGGATGA 


TTCTGAGATA 


1320 


ATCTTTGGCA 


GTCAGTTCAA CCTTGCCATG GACTTGGATA 


TTTTCAGCGT 


GGAAGTGATT 


1380 


CTCTGTTGAC 


TCTAGCTGAC TATCTGTAAG AACTGTATCA 


AAGATATTAA 


CGATATTGGG 


1440 


CGTTGTGAGT 


TTACTGTTTT TGATACGACT TCCTTCAATT 


CGGAGGATAT 


AGCTGTTTGT 


1500 


ATTGAGGGTC 


GCATTTTCAA GGCTAGCATT TATGATGGTG GTTTGTCCGC GATTGGCTGA 


1560 


GATGTTGATC 


CCTTTTAGAG TTCTCCCTTT TGGTAGTCGG 


AGAATAACTT 


CTTCAAAACG 


1620 


ACTAGAGTAG 


CTACTTGCGA TATGAAGAAT CCCACCAATT 


CCAGAAGAGA 


GAAACGGAGT 


1680 


TTCAGACAGT 


TTCTTATCAG TGAGACTCAG AGTTCTATCG 


TTCTGATTGG 


TGATAAGATC 


1740 


ATGGTGAGCA GAAAGAGATG GATGGTAAGA AATGTGGATT 


TGATCATCGA 


AAGAGTCTGT 


1800 


GATGGTGAGC 


GTGTGTTGGT GGAGAGTAAT TTCTAGGTTT 


TCGACTTCCT 


TGCCAAAGGT 


1860 


TAGCTTTTCC 


GTACGGCTAT CATAGACAGG TTCTTTGGAC 


ATGGAAAGTA 


GGCTCTTAAt 


1920 


CCCGTCAGAT 


TGGATACCTA CAAAAAGCAG GATAAAGCCG 


ATAACGGTAG 


TCACCACACC 


1980 


AAAGATGAGA 


AATCCTTTTG TCCATTTACG CATGCTGATT 


ACCTCTCTTT 


CCTTTTTTAA 


2040 


GAACAAATTG 


TACCAGACGA ACAATGAGTA GACCGAAGAA 


GCGAGTTGCA 


TAGGAAATGC 


2100 


CAAGTAAAAC 


TAGCGAAGAA GCACCGATAG CCAGTAAACC 


AGAACCAAAA 


ATCAAGATAA 


2160 


AGGCTGATTT 


GGCTTGGGCG AGGACAGTGA AACTTTCAAC 


TAAAAATAGG 


AATCCGCCGA 


2220 


TGATACCCAG 


TATGGAAACT GCAAAGAAAG CCAGAATGAC 


AGTCAAAGCG 


GCTACAAGAA 


2280 


TTGCGAACAG 


GGTCACGAGG ATGGCGATTC CCAGAGGAAT 


GCCGATAGGT 


GCTGCAAGGA 


2340 


GGGCTAACAA 


GGCGATATGT AAAATTTGTC GGTTATTTTT 


TTGAGCGGGT 


GCTTCATTGA 


2400 


TTTTTTTATC 


GAGAAGATTG GATAGAACTT CGTGGGCCGC 


TTCTTTGGGA 


GTTCCCAAAC 


2460 


TAGCGATGAG 


TTCTTCTTCT CCTTCGACTC CAGCATCGTC 


AAAGAGCTCT 


CTGAAATAGT 


2520 


CCATGGCTTC GATACGGTCA GCTTCAGGTA GTTTCTTGAG ATAGAGTTCT 


AGCTGAGTCA 


2580 


GGTATTCAGT 


TCTTGTCATG GCGGATACTC CCTTCTATGA 


TGCCATTGAT 


GGTGTCTGTA 


2640 


TAGAGTGCCC 


ATTCATCTTT TAGGGTCAAG AGCTGCTCTA 


TACCACCGTT 


TGTCAAGGAG 


2700 


TAGTATTTGC 


GCATGCGACC TTGGAACTCT CTAGAATAGG 


TTGTCAGAAA 


GCTATTGCCT 


2760 


TCCAATTTTT 


TGAGAATGGG ATAGAGTGTG GATTCTTTGA 


TATTAGCGAT 


CAGCTTAATG 


2820 


GTTTGGCTAA TCTCATAACC ATAAGAATCA CCCTGCTCCA GTACAGCCAA 


GATGAGAAAT 


2880 


TCAATCAAGG 


CAGAGGATGT TGGAAAGTAC ATGGGAAACC 


TCCTTTTCTA 


ATGTGTAAGA 


2940 


TTTTTATATA 


TAATTTTTCT ACACATACAT TGTACATCTA 


AAAGAAAGCC 


CTGTCAAGAG 


3000 


AAATGTGTAA 


AATTTTTATA TATAAAAAAC TTCTAGCTAA 


AACTAGAAGT 


TTAAAGGATC 


3060 
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TTATCCGCTC TGTCCACTGT AAAGAGGGCC ACAGTCATCA GGATATCGAT GAGCAAGAGG 3120 

GCAGCTACAG ATGGTACCCA AGAGTGGAAC AGGTCAAAAC TGTAACCAAA GAGGGTTGGC 3160 

CCAAAGGCTG CTAGGATATA GCCTCCTGTT TGAGATAGGC CGGACAATTG GGCTGTCTTT 3240 

TCAGGGGCGC TTGTCTTGAG TGAAAAGTTG ACCATGAGAT AAGGGAAGAG GGCACTGGTT 3300 

GCGGTTCCGA TGAGGAGATG GATGGCAAGC CAGTAAATGA AATTATTGAT TGGGAAAAAG 33 60 

- AGCATGGAAA TGCCGACCAC ACCAGCTAGT GAAACCAGAG TGAGCATGAG CTGACGGTTG 3420 

CGAGTAGATA AACTGGTTGT CAGGCTTGGG ATGGTCATTG AAAAAGGAAT GCTAATCAGA 3480 

GATAAGATAG AAGTCAGCAA GCCAGCTTCG TGACTGGATA GACCTGCATG GATAGACATG 3540 

GTAGGTAACC AGGTCATGAC GGTGTAAAAG ATCAAGGATT GAAAACCTGA AAAGATAATA 3600 

ATTGCCCAAA CCTGTTTATT ACGCATGACC TTTATTTGAC TTTTTTGTTT GGTTTGTGGA 3660 

GCTAGTCTAT GATTATAGCG GTGATTTGGG AGCCAGACCA AAAAAGTTGC TAGACAGAGT 3720 

AACGTGAGGA GAAGGATAAG TCCTTTCCAA GAACTGGCTT GTGTAATGGG CACAGCTAGA 3780 

TAGGAA 3?86 
(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

TCAGCTAAAA AACATTGCTA AATTGATTGA AGCTGGTGCT ACACATTCCG ATTCAACTTC 60 

TCACACGGCG ACCACCAAGA ACAAGGTGAG CGTATGGCAA CTGTTAAACT TGCGGAAAAA 120 

ATTGCAGGTA AAAAAGTTGG TTTCCTTCTT GATACAAAAG GACCTGAAAT CCGTACAGAA 180 

TTGTTCGAAG GTGAAGCTAA AGAATATTCA TACAAAACTG GTGAAAAAAT TCGTGTTGCA 240 

ACTAAACAAG GAATCAAATC AACTCGTGAA GTGATTGCGT TGAACGTTGC TGGTGCTCTT 300 

GATATCTATG ATGATGTTGA AGTTGGTCGT CAAGTTTTGG TTGACGATGG TAAACTTGGT 3 60 

CTTCGTGTGG TTGCTAAAGA TGATGCAACT CGTGAATTTG AAGTTGAAGT TGAAAACGAT 420 

GGTATCATCG CTAAACAAAA AGGTGTGAAC ATCCCTAACA CTAAAATTCC TTTCCCAGCT 480 

CTTGCTGAAC GCGATAACGA CGATATCCGT TTCGGTCTTG AACAAGGTAT CAACTTCATC 540 

GCAATTTCAT TCGTACGTAC TGCAAAAGAT GTGAACGAAG TTCGTGCAAT CTGTGAAGAA 600 
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ACTGGAAACG 


GACATGTTCA 


ATTGTTCGCT 


AAAATCGAAA 


ACCAACAAGG 


TATCGATAAC 


660 


TTAGATGAAA 


TCATCGAAGC 


AGCTGATGGT 


ATTATGATTG 


CTCGTGGTGA 


TATGGGTATC 


720 


GAAGTACCGT 


TCGAAATGGT 


TCCAGTTTAT 


CAAAAAATGA 


TTATCAAGAA 


AGTCAATGCT 


780 


GCAGGTAAAG 


TTGTTATCAC 


TGCAACAAAC 


ATGCTTGAAA 


CAATGACTGA 


AAAACCACGT 


840 


GCAACTCGTT 


CAGAAGTATC 


AGATGTATTC 


AACGCTGTTA 


TCGACGGAAC 


TGACGCTACA 


900 


ATGTTGTCAG 


GCGAGTCTGC 


AAACGGTAAA 


TACCCACTCG 


AGTCAGTAAC 


TACAATGGCT 


960 


ACAATCGACA 


AGAACGCTCA 


AGCTCTTCTT 


AATGAATACG 


GACGTCTTGA 


TTCAGATTCA 


1020 


TTTGAGCGTA 


ACTCTAAGAC 


AGAAGTAATG 


GCTTCTGCTG 


TTAAAGATGC 


TACTAGCTCA 


1080 


ATGGATATCA 


AATTGGTTGT 


AACTCTTACT 


AAGACAGGTC 


ATACTGCACG 


TTTGATTTCT 


1140 


AAATACCGTC 


CAAATGCTGA 


CATCTTAGCA 


TTGACATTTG 


ACGAATTGAC 


AGAACGTGGC 


1200 


TTGATGTTGA 


ACTGGGGTGT 


TATCCCAATG 


TTGACAGATG 


CTCCATCTTC 


AACTGACGAT 


1260 


ATGTTCGAAA 


TCGCTGAACG 


TAAAGCGGTA 


GAAGCAGGTC 


TCGTTGAGTC 


AGGCGATGAT 


1320 


ATCGTTATCG 


TTGCTGGTGT 


GCCAGTAGGA 


GAAGCTGTTC 


GCACAAACAC 


AATGCGTATC 


1380 


CGCACAGTAC 


GTTAAGAAAA 


ATATAAAAAC 


CTATCATATC 


CAGCTTTAGA 


GCTTGTGTGA 


1440 


TAGGCTTTTT 


GTATAGAGGG 


TAAGAAATAG 


GCAAAACTTT 


CATAATGGAT 


TGATACTCTT 


1500 


CGAAAATCTC 


TTCAAACCAC 


GTCAGCGTCG 


CCTTACCGTA 


TATATGTTAC 


TgACTTCGTC 


1560 


AGTTCTATCT 


ACAACCTCAA 


AGCAGTGCTT 


TGAGCAACtG 


CGGCTAGCTT 


CCTAGTTTGC 


1620 


TCTTTGATTT 


TCATTGAGTA 


TGAAATAAGA 


TATGCACAAA 


TTGATTAGAA 


AGTCAAATGA 


1680 


ATTTCTACAA 


ATGTTTTAGC 


AATCGTAATG 


TACTTGTCTA 


GATTCGATCT 


GATATATTTT 


1740 


CGATTTAATG 


ATATGGTATT 


TAAAACCTCC 


AAAGTAGCTT 


ACTCCATTCT 


TTTACTTACG 


1800 


TGAGTGTAGA 


TGTTATTTAC 


TGTTTTAGCG 


TTTTTGTGTT 


CCACTCTAAC 


CATTATAGCA 


1860 


TTCTTCTCAG 


CTAGTGTACT 


AAGGAGTGTG 


TGCCTGAAAA 


TATGGGAACT 


AAGGGGCTGG 


1920 


TTTATCGGTT 


TCTCTAGTTT 


AGTATTTGCC 


TTTTGCAAAG 


TGATCTTAAA 


TGCCTTTCTC 


1980 


TAAATTTACA 


TATCACTATT 


GTTTAACAAA 


ATCTAATCTA 


TTTTAGGTCA 


CTTATTCTTT 


2040 


TTTTGAAATG 


TAGAATGAAC 


TTTTTCAAAG 


TTTTTCGAAT 


CTTTTAAAAT 


CTGTTTGCTT 


2100 


TATATCGCCA 


TTCTCCCCCC 


TTTTTTAATT 


CTCCCTATAT 


AGCCTGACAG 


CTTTCCCGAT 


2160 


GGTACGAATA 


TGGTTGCTTT 


CGTCTAGGTG 


GATGTCGGGG 


TATTCGGGAT 


TGAGTTTTTT 


2220 


TGAGGCAGCC 


TTGGCGGAGT 


TTCTTGACAT 


AGTTAGTGCC 


GTCTACTTGG 


AAGATGCCGA 


2280 


TGGTATTATA 


GTCAATCTGT 


GGGGTATTCT 


TGATAAATAG 


GTAGTCGCTG 


TTTCTTATCT 


2340 


TTGGCTCCAT 


GGACTTGCTG 


ACGACATAAG 


CGATTGGGTC 


GTAGTCGTCT 


GGGATAATGG 


2400 
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AAACTCCATA TCTAAATCGT TGTCCTGCAT CGAGCGGCTA CCTGCAGAGA TAAACTACCT 2460 

AACACGAGAG TAAGTAGTCT GTCTGTAGTC GTCCAGTCTG ATGATTTTTA CGATACTTCG 2520 

TTTTTCTGAT CATACAGTTG CCTCTCGGCA TAGGTCAGAA CTTTACCTTG TCTGGGTGGT 2580 

TCCCGTTGGT CGTAGATAGA TTGGATATCG CTAGGAGAAT CCTTTTGAAC TGGAGGAAAG 2640 

AGGGCATCGA TCAAGCTACT GAATACTTTA ACTAAGTCAA ATATAGTATT TTTCTTAGTA 2700 

-GACCTAACCC TTTTTTCATA ATTTCTAATG GTGTTTTTAC TTATACCTAT CTTAGTACCC 2760 

AATTCTTATT GAGTCCAACC ATTACTAGTC TATATTGTTT TATAGTTGAT TGAGTTTGGA 2820 

ATAGTACGCT GTAGCTGCTA AAACATTTCT AGAAATTAAT TTGACTTTCC TAATAGAGTT 2880 

GTTCATATCT TATTTCAATC TATTATGTTT TTCACCTCTA ACAATCGCAA TCTCTTCTTT 2940 

ATCCATGAAT GAAATCGCTT TCTATTTTTG TAAGTAAAGC ATAACACGAA ATCCACGAAA 3000 

ATGAAAACCT TTGTTGTGTT TTCGTAAAAA ATTTGTTGAC AGAGCACGAA ACGC 3054 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TGTGATTTTC yGAAAATTTG GTAAAATATA TCTTAATCAT TTTCAGGAGG ACAAAAATTT 60 

GACAAGATAT CAGAATTTAG TAAATGGAAA ATGGAAATCA TCTGAACAAG AAATTACGAT 120 

TTATTCACCA ATCAATCAAG AAGAATTGGG TACAGTTCCA GCCATGACTC AGACTGAAGC 180 

TGATGAGGCT ATGCAAGCTG CGCGTGCAGC CCTGCCAGCA TGGCGAGCTT TATCAGCAGT 240 

TGAACGTGCG GCTTATTTGC ATAAAACAGC AGCTATTTTA GAACGCGATA AGGAAGAAAT 300 

TGGTACTATC CTTGCCAAAG AAGTAGCAAA AGGGATTAAA GCAGCAATTG GAGAAGTAGT 360 

GCGTACAGCA GACTTGATTC GTTATGCTGC TGAGGAAGGT CTCCGTATCA CTGGACAAGC 420 

AATGGAAGGT GGTGGTTTTG AGGCAACAAG TAAAAACAAA CTGGCTGTTG TCCGTCGTGA 480 

ACCAGTTGGT ATCGTGCTAG CGATTGCTCC CTTTAATTAT CCAGTTAATT TATCTGCTTC 540 

TAAAATTGCA CCTGCCTTGA TTGCAGGGAA TGTGGTCATG TTTAAGCCAC CAACACAAGG 600 

TTCCATTTCT GGACTCTTGT TGGCTAAAGC ATTTGAAGAA GCAGGGATTC CGGCAGGTGT 660 

TTTCAACACC ATTACAGGTC GTGGTTCAGA AATTGGGGAT TATATCATTG AG C AC AAAG A 720 
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AGTCAACTTC ATCAACTTTA CAGGTTCAAC TCCTATTGGA GAACGTATTG GTCGTTTAGC 780 

TGGTATGCGT CCTATCATGT TGGAACTTGG TGGGAAAGAT GCAGCTCTTG TACTAGAAGA 840 

TGCAGATTTG GAACATGCTG CCAAGCAAAT TGTTGCGGGA GCCTTTAGCT ACTCAGGACA 900 

ACGTTGCACG GCCATTAAAC GTGTCATTGT TCTCGAAAGT GTAGCAGATA AATTAGCTAC 960 

TTTGCTTCAG GAAGAAGTTT CTAAATTAAC AGTTGGTGAT CCATTTGACA ATGCTGATAT 1020 

TACACCTGTT ATTGACAATG CTTCAGCCGA CTTCATTTGG GGCTTGATTG AGGATGCACA 1080 

AGAAAAAGAA GCTCAGGCTC TTACACCAAT CAAACGTGAG GGCAATCTTC TCTGGCCAGT 1140 

GCTTTTTGAC CAAGTTACAA AAGATATGAA AGTGGCATGG GAAGAGCCAT TTGGTCCTGT 1200 

TTTACCAATC ATTCGTGTGG CTAGTGTAGA GGAAGCTATT GCCTTTGCCA ACGAATCTGA 1260 

ATTCGGCCTT CAATCATCAG TCTTTACAAA TGATTTCAAA AAAGCCTTTG AAATTGCTGA 1320 

AAAACTTGAA GTAGGTACAG TCCACATTAA TAATAAAACC CAGCGTGGTC CAGATAATTT 1380 

CCCATTCCTT GGTGTCAAAG GTTCTGGAGC TGGAGTGCAA GGAATTAAAT ATAGCATTGA 1440 

AGCGATGACA AATGTCAAAT CCATTGTTTT TGATGTGAAA TAACGTGTAA AACCAGGAAA 1500 

TTGTTTTCCT GGTTTTATTT TTTTGCTATA AAATAATAAT AATTATAGAA AAAATACGAA 1560 

CTTTTTGGTA TTATAATAGA TTGAAACCGG 1590 
(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4848 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CCTGCAGTTG TCAGACCTGT AATTTTCTTT TTATCTGTAA TAAGAATCGT TCCAGCGCCT 60 

AGAAAACCCA CACCTGATAT AACTTGAGCT CCTAATCGTG TAGGATCTCC TGTCCCAAAT 120 

TTATAAGATA CGTATTCATT CGTCATCATA ATCAAACATG CAGCTAGACA AACAATACTA 180 

TAAGTTCGGA TGCCTGCAGG CTGGGATTTG CTCCCTCTCT CTAAACCAAT TATACTACCA 240 

ATGACTACTG ATAAAACAAT CCTGACAACT ATTTCAATAT TTGATAACCC AAGACTAGTG 300 

GCTGTCATGA TTATTTCCTT ACTTTACGCC CCGGTCTTTG TGTGAAGTAT AATACCGTTC 360 

CAGAAATAAT CATCAGAACA ATTGTATAAA CAAATACCAG AGCTTGTGCA TTAGATGTTG 420 

CTGTTTCATC ACCTGCAGAT CGAATCGTAA TACCTAATGG TTGAGCTAGG GGATGGTAAA 480 

GGAATACAGA TAAGTCGAAG TCAGTTAATA AAGAGTTAAA GTTTAAAGCA ATAACAGAGA 540 
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GAACAACCGG TAAAATAAAT GGAATGATAA CCTTCATCAT AGTATAAAAA GGTGAAGCAC 600 
CCATACTTCT TGCTGCATCT TCCATCTCAT CATCAACACT AAATAAAATA GCACGTACCA 660 
TTCTATAAGA AAATGGGATT TTTACAACTA TATATGCAAT AAGTAGAATT ACCAAACTAC 720 
CTACCAAAAT CTGATTCAAG ACAAGAAATT GTGGCTGATT AAAAGTAAAT AATAAACTTA 780 
CTGCTAAAAG TGTACTTGGT AGTAACCAAG GAAGTAGAGC ACCATATTCA AATAAGAAAT 840 
CAAAACGAGA TTTATGTTTT CTGACAACAC GAGCAAATAC AACTGCGAGA ATTGTTGCTG 900 
TTGTCGCAGC AATAATAGAA TAAATAAAGC TGACCAAGAA TGGAGAGAAT GCCGCACTAT 960 

TACTAAAGAA TAAGCGATAA TTTTCTAAAG TAAAGTTTGA TAATGTTAAG TTACCTGTTT 1020 

GAATTGCAAC TGGATCTGTA AATGAGTATA ATACTATAAA AATTAGTGGA AGCATGAAAA 1080 

CTGTGAACAA TCCATATGCT ACAATGTGAG CAATGATATT CCAAGGCTTA GACGCAATTT 1140 

TTTGTTTTTT AAGAGGCGCT TTAGTCTTAG AGATAGAAAT ATAATTTCCA CCTTTTTCTA 1200 

TCTTATTCAT GATAGTAAGC AAAATTGTAG TTGCAATACC TAAAATAATT GCAAGTAGGG 1260 

CAGCTAAATC ACGAGAATTC CCCATCCCTG CAAATGTAAT AATCATTGGA TTTATAGTTT 1320 

GAAATTCTTT ACCACCAACA ATCATGGGTG CTGCTACTGC AGATAAACCA CTAAGAAAAA 1380 

CCATAATAGT AAGTGCAAAT AGAGTTGGAA TTAAGGTTGG TAACACTACT TTTCGGAAAA 14 40 

CAGTAAATGG TTTTGCTCCC ATATTTCGAG CAGCCTCAAT AGTGTGATAG TCAACGCTTC 1500 

GAATTGTATT TGTTAAAAAC AATGTATGAT TAGCAGTTCC TGAAAATGTC ATAATGAATA 1560 

AGACTGCACC ATACCCAATA AACCAGTTAG GGTCTAAAGA AGGGATAACA TTTTGTAAAA 1620 

ATTTTGTAAT CAATCCATAA GGACCATAGA CAAATTTATA TCCAGTCGCT AAAACCACTC 1680 

CTCCATAAAT TAAAGAGGTC ATATAACCTA ATTTTAAAAT TTTAGCACCT TTAATATCAA 1740 

AGTACTCTGT AAATAGAACA CAAAGAATAC CTACGACATT AACTGTAATA ATGAGTGAAA 1800 

ATGCTAACTT AAAACTGTTC ATAATACTCT GAAGTGCCCT CTGAGATTTT AGAACACGAT 1860 

GTACAGCATC AAGGGAAAAT TCTCCTCCTT TTACAAATAC ATTCACTACT AGATCAAAGT 1920 

TTGGATAAAT AATAAATGTT ACTAAGAACC AGATTAACCC TAAACGAATA AGCCAATCTT 1980 

TTAAATTTAA TTTATGACGC ATACTGCACC TCCTTAAAAT TGCAGAACGT CTGATGGTGT 2040 

GATAAATAAT TCCACACTTT CTCCGACAGA TCTAATAGCA GCCTGACTAT CAATACTTGT 2100 

TACATTAAGA ATCTGACTTT CAGAAACTTT TATTGTATAG TGAATTGTAA CTCCAGAAAA 2160 

CTCAACATCA ATAATTGTCC CTTTTAGAAT AAAATCTTGT TCAGTTTCAC GATTGAATCG 2220 

AACTTTCTCT AATCGAATGT ATCCTTTTTT ATCCTCTAAG AAAACGCTTG TATTTTTCAA 22 80 
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TAATACTTCG TGGACTGTTT CATCGGTCAA AACATTAATA TCTCCAATAA AATCACATAC 2340 

AAATTCAGTT TGAGAATTAT GATAAATCTC TACTGGTGTA CCGACCTGTT CGATGTATCC 2400 

ATTGTTAAAG ACTGCAATTC TATCAGATAA AGTCAAGGCT TCCTCTTGAT CATGAGTAAC 2460 

ATATAAAGTA GTAATACCTA ACTCTTTTTG AAGTCTTTTC AACTCTTTTC TCAAATCTAC 2520 

ACGTAATTTT GCGTCAAGGT TTGACAATGG TTCATCTAGA CAAAGAATTT TAGGTTCAAG 2580 

AACCAGAGCA CGAGCCAATG CTACCCTTTG TTGTTGACCC CCAGATAATT CTGATACATT 2640 

ACGCTGTAAC TGTTGATCAG AGATCTTAAT TTTTGCTGCC ACTGCTGATA CTTTAGCTTT 2700 

AATAACATCT GGAGCTACCT TCTTAACTTT TAAACCAAAT GCAATATTAT CAAAAACAGT 2760 

CATAGTTGGA AATAGCGCAT AAGATTGAAA TACAATACCA ATTCCACGCT TTTCAGGTTC 2820 

CAAATGAGTG ACATCTGTTC CATTAACTTC AATACTTCCT GATGATGGAT CTAGAAAACC 2880 

TACCAATGCT CTCAAAGTAG TTGATTTACC ACATCCTGAA GGCCCAAGAA ATGTAAAAAA 2940 

TTCCCCTTCA TGTATATCTA AATTCAGATT ATCAATTGCA ACAAAATCAC CATATTTAAT 3000 

TTGAATATTA TCAAATTTAA TCATCTCACT AACTCCCTCT ATTACTAAAC CAAAAGCCTC 3060 

TCTTTATTTC TTCCATAAAT TTAGAAATAA TAGAGAGACT TGGACATAAA AATTAACTCT 3120 

TATTTCTTAT TGTACGTATT CTAATTCAGC TTTTTCTACC CATTCATCCA AATGCTTTCC 3180 

AACAGCTTCC CAGTCAATAT TTTGTGGTTT CACTTGATCA ACAAATTTCT TCGTATCTTC 3240 

AGGTAGATCT TTGAGGGCAT CTTTATTTGC AGGAATAGAT CCAAAGTTCT TACTATATTC 3300 

TACTTGAATT TCTGATTGAC CAAACCAATC AATAAATTCT TTAGCTAACG CTTGTTTTTT 3360 

ACTAGTGCTT AAAACCATAG TTTGTTCAGT TACAAATGGT ACACCAATCT CAGGAGTCAT 3420 

AACTTTGAAA ACAACATTTT GTTCTTTTTG TCCAACTAAT GCACCAGAAC CCCACATCAT 3480 

TCCATATTGT ATTGGATCTT CTTTGTCTAA CATCTTAACA ATTGAACTTT CTCCCTTTTG 3540 

AAGAGTGTAT GCATTTTTCA AATATTCTTT TGCTACTTCC CAACCTTTTT CGGAAACACC 3600 

TAATTCACCT TTATCATCAA GGTATCGAAC TAAGATACTT GCTAGAATTG CCCGTCCTGT 3660 

ACCTCCTTGA AGACCAGAAA TTGAATATTT ACCTTTATAC TTACTACCTA ATTCAGTCCA 3720 

ATCTTTAGGC ATTTCTTTTA CATCAGGCGC CCCAATTAAA ACTAATGGTT GAACAATCAC 3780 

AGGATTATAA TAATTATCTT TATCTGATAA AGATTGATCA ATTTTATCTA ACCATTTAGG 3840 

CTTGTACTGT ACTAGTAATT TTTGATCTCT AATTTTATTT GAATCAACAG CACCAATTCC 3900 

AAATACCATA TCTGCAACTG CATTATTCTT CTCAGCAATA ACACGGTCTG CTAATTGAGC 3960 

GCCAGCGATA TCAACCATTT TTATATTAAA ACCAGCTTCT TTTGCTTTAG CAGTTAACCA 4020 

ATCACCACGA CCATTTGAGA CTGAGTTCGA ATAGATAACT AATTCTTGAC TTTTATCAGC 4080 
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TTTTTCTTCA GATGAAGAAG CAGTCGTAGA ATTTGAACCT CCAGAGCAAG CAGCAAGTGT 4140 

AGTAAgAGCA ACTCCCGTTG CAAGTACAGT AGACCAAACT TTCATTTTTT TCATGATAAG 4200 

TTCTCCTTTT TTATTATTTT ATTTAAATTT TTCGTGATAT GGAACAAATT GTCTCATATC 4260 

TTCAAATACA GTATAGTCAA TACGGTTTAC AGTAATAGTT GG AATCTTCT CTAATAAAAT 4320 

TTCAGTTAAT TCTGCTCTGA CTTTAGTAAA CTCTTCTTCC TCCTCTTCGG TTAGAGGAAT 4380 

"CCGAAGATAC CCAATTGAAA TATGGAATTG ATATCTATCA TGATTAGGGA AACAAACACC 4440 

TGCTTTTTCT GAGACATAAG TACGAATTTC TTCTAATCTC TTTGCAGAAG CTTCATCTGC 4500 

AGGTTCAACT AGTATGTTTT GTTTTCCCAT TTCAGTTATA CGCATATGAA TTTCTTCATC 4560 

CAACAATGGA AAAATTTCAA GTTGTTTAGC AAAGTAATCA TGTATTTCCT GTAAAGGTGT 4620 

ATCTAGAGGA AGATTACTGC TCCAAAACTC gtTCACGATT TTCATGGCAC AACAATTCAA 4680 

TTACAGTCAT GTGAATAGAA TTCCTTGGAG TTAAAGTAAA CTTATCGATA AATGGTAATT 4740 

CTCTATAACG TGATTGAATA ATATCAACAA CTTCCATCAA ATCTTGTTTA GTATAAAGAT 4800 

TTGCTACAAC TGTATTCCCA GGGAAATGAT TAAATTCCCC ATTCTCGG 4848 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GTTATAAGCA ACACCTTCTT GCTTGCCATA AGTTGTGAAA TGGGTAGAAT CGATATCTAC 60 

AATGAGTTGG TTTAGCTGGT GAAACTGTAA AAAGAATTCG ACCAATTCAA GGTTGAGGCA 1"20 

TCGCAAACTA TGGACTGTTT CCTCGTCAGT TCTGGAAAGA AAACGGGATA AGGTTGGCTG 180 

TGAAGCAAGC TGCCCTCCTT CCAATAATTT TGGAAAGTAG GCATCAGCTG ACAATTCTTT 240 

ACAAGCATAG TCCGTTCCAT AACCTGTTAA CAGTTGAAAG AGGAACTGGA CAAGGATATC 300 

TGAATCCGAA TAACGACAGT AGCGGCGTTG GTCATTCGTT ACTAAATACT TAGAAATCCG 360 

CTCTTTTAGT TTCAACTGGG AAAAAAGTTC CTGAAAAAAG ATAAGACCAC CATACTGGGT 420 

TAAATGACCT CCATCGAAAG ATAGTTGGTA AAAAGACTTG TTTTGGAAGT GATGATTTGG 480 

TAAACTGTTC ATGTGAGTTT CCTTTCTTTT TGTGTTTTTT TCTACACTTA TACCATAAAG 540 

GGGAAACTCT TTTTTGTCTA GTAAAAAACA CCCATTGGGT GAAAAAAGAA ACCATCCAGG 600 
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ATCTAAGCTA AGGCAAGGAT TCTGGATGGT TTTTAGATTT GGGGTGAATA ATTGGGGATT 660 

TAGGAGAAAT GATGGTATCT TCCAAATCAA AATCAACTTC ACTCCATAGT CTCAACTGAT 720 

TGATTTTCCC ATCTTGATAG GTCACATCCT TGTCAAGGAT AAACTGAGTC AACACCTCAT 780 

GTTGACCTTG ACACCTGATG TCATCTACCA AGAGCCAGAC ATCCTCTACC AACATGAGGA 840 

TTTTTCTCCT GTGAAGATAA GGCAAATCAG GTTCTGCTGA CCAATAAGCC CCCTCAATAT 900 

AATGCACTCC CTCCCTTTCT TTATGGTGAC AAAACAGGGA GTGAGGATAG TATTCATATT 960 

CCCAGGATCC CGTGATTCTT TCCGGAGCTT TCCCATCTAC AATGCAGGTC GAATGACTCC 1020 

AAGCACTCTT TAAGAGATAA CGTTCATATA TCTCCCGATA AGAATAACGC CCAGCATCTA 1080 

TGAAAATAGG TTGGCCTTGA TACTGTAAGC AAAAACTATT CTCGTCACTA TGACTATGGG 1140 

CACTTCCTAG CGGACCATTT TTGAAAAATA GATAACGATG TTCATCCTTA ATGCAGACAT 1200 

GTCCAGAGTC TTCAAAGATC ATGGACTTAG GCTGCCAAGC TCTCTTTTCA AATTCCTGCA 1260 

GTCGCTTGAC CTTTTCTCGC CCCAGGAACA AGAGGCTAAG CAAATCAACT TTAACATCCA 1320 

GACCGTTAAG AAGGTCTTCC TGGTTCAAAA CCACAGCAGA CAGGCTCAAA ATTTCTGTCG 1380 

TTTCTGTAGA ATCGCTATCA CCAAAAGCCA AAGTCCGTCC ATCTAAGCCT GTCATCATTT 1440 

GAATATAGGT CGCCATCTTT TCCAGCAACT CTTGGTAACT ATCTTGCAAG TCTGGAAGCA 1500 

AGAGACACAA ATCCAGCAAG GCTTTATAAA CCTCTACATG ATAGAGAATC GACTGTTCAA 1560 

ACTGGCTTCC ATCTCCTAAA ATCTGTGTCT CAATTTGCTG TTTCAACTCC TCTGAAGCAA 1620 

AATGGTAAGC TTCTTCTAGA TCCATCTTAT CTGAAAAGAA ATGATAGATA GCAAGCATCG 1680 

GAATTGTTTG TAAAATCCCC CAGTTACTAA GGGTGTACTT GGCGCGATAG TAGCTTTTCA 1740 

TAAAGTCAAT CTGCTTTTCT AGACTGACCA AAATTTTCTC TAGTTCTTTC TCCTCTAGCA 1800 

AGTCAAATTT CAAGAGGAGC AAGAGTAGTT TCAACCAAGT AAAGGAACGA ATACCCGTAT 1860 

CCAAGGTTCT AGTCATCAAG GATTGAGGAG AAAATTCTCT CACCTGCTCA ATCCAATCAA 1920 

ATAGAAAGAA CTTGCACTTT TGAATATAGT CCTTATCTCC TTCTACCAGA TACCCTATCA 1980 

TAAACTGCAA GAGATATTCT TGTCGATTGA GCATATAAGA CCATTCTGGA TCATCTTCAA 2040 

ATACTTGATC CCATACCATC GGCTGGATTT GATGGATTTT TGAACAAGGC TCCATATCCC 2100 

AAGGACTATC AAACATAAAA CGATTGTCCA TCAAGCGTTC AAGGGAACTC TTGACTTTCT 2160 

CATAGTCTTT TGAACAGTGC GACAAGATAT AATCACGACA TTGATTTCCA TCGACTCTTT 2220 

CAAAAAATTG TCTTCTTTCT TCTTTCATTA TCTATTACCA GAAAAAGAAC TACTTAAAAA 2280 

GCAGTTCTTT TGTCTTTCCC ATTACACTTT CCTTTTCTAC ATGGATGACC ACACCTTTTG 2340 

CAATCTGCAA GGAGACCAAG TCATCTTGGA TAGAAATGAT TTTTCCATGA ATTCCAGACA 2400 
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ATAACAACAC TTCATCACCA AATGTTAAAG AAGCTAAATA CTCTTGTCGT TGCTCCATCT 2460 

GTTTGCGAAG CAACTTTTGC TGACGAATAG AATGAAAGCT TGACAGTAAA AGGGGACTCA 2520 

CTGCCAAGAC AATCACTATT CCATAAAACA ATGTTGTATC CATTAAGCTA TAATCTTAAG 2580 

CCAGCTTCCG ATAATTCCGA TGATAACTGT TAAAATAACG AGTTTATATG TTGTCCATTT 2640 

CTTTTCTTTG ATCAAGTAGT AAACTAAAAG TGTAAATAGG GCTGGTAGAA GAGCTGGAGC 2700 

-AACCTTATCA AGCATTCCCT GAATACTTAC GATACTTTGT TTAGCGTCTG CTTTAACTTC 2760 

CCCTGCAGCA AAGGTAATCG GCACCATAAT CTTAACAGAT GTCGCTGCCA AACCAGCAAT 2820 

TACGtTACAC CGATAATATT GGCAATACGA GAAATCGTTG CCATCTGTTC GCTTAGTTTA 2880 

TCAATCACAG TTGTTCCTAG TTTGTATCCA TACAGACCAG TTGACAATTT AATCGCTGTT 2940 

AAAATCGTAT TCATCGCAAG GAAGAACAAG ATTGGACCGA CAACCAAGCC TTCTTGAGCA 3000 

AACGAAGCTG CGATGGTTGA GAACAATGGA GCTAAACAGA ATTGAGAAAG AGAATCCCCA 3060 

ATACCTGCCA ATGGTCCCAT CAAGGCCATC TTGATGCTAC GTGTTTCTTT TGCCGGACGG 3120 

CCATTTTCCA ACATTACAAG ATGCAAGCTG GTAATAAAAG GCAGGAAGTG TGGGTTGGTA 3180 

TTATAGAATT CACAGTTTTC TTCCAAGGCT TGGTAGAAAC CTTCCTGATC CTCTCCATAG 3240 

TGTTTTTTCA AAGCAGGATA CATCACATTG GCATATCCCA ACCCTTGATA GTTACTATAG 3 300 

TTAAATCCAT TTTGACAAAA GAATGCCCGC AAAGACGTTT TAAGATAATC ACGTTTTGTT 3360 

AATTTGTTAG ATCCAGTCAT CGTGTGCTTC CTCCTCTACC ACATGATCCG CTGTTTTTGG 342 0 

CTTGTTATAA AATTCAATCA AAGCAAAGAT AGTACCTACA ATTGCAATAC CAATTGTTGG 3480 

GATGTTTAGA TAAGCTGCAC AAACATATCC CAACAAGACA AAGGGAATCA ACTCTTTCTT 3540 

AGCCATCACT GACAAGATCA TCGCAAAACC GATAGCTGGG AGCATTTTAC CAGCAACTGT 3600 

CAAACCTGTA AGTAATACCG GTGGAATGTA GTCTACGAGT TTCAACAAGG TATCCATTGA 3660 

AAGGGCACCA AGCAACCCAA GGTAAATCCA ATAAAGGCAA ACAACCAAAT TGTTGCATTT 3720 

AGAGTGAACT TAAATTTCTT CAAATTATGG TTTTTCAAGT GCT 3763 
(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5053 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
{ D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
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V_.rVr\ 1 L 1 L 1 \At\ 




TCAATACTAw 


CAAAGGGAAT 


yCCTGACGTC 


AAGTAATGTT 


60 




A 1 AtjijTAATG 


GCAACCACTC 


CATCAACTTT 


ATTATGACGC 


AACATCTCCA 


120 


«A 1 /VJ 1 V_ 1 1 \j 


CTCTCTATTT 


GTACCATTGA 


TAGAACATAA 


GAGTAATTTG 


TTATTTCTCT 


180 


1 A I AuAl_ 1 it- 


ATTTTCCACA 


TGCATAGCAA 


ATTCTGAAAA 


GAAGGGATGC 


CAGATACTTG 


240 




TGCAATCGTT 


TCTGTTCGAT 


TTTTTTTCAT 


TCCTCTAGCG 


TAGTAATCTG 


300 


fix* nvfrt IV J\ mm 
ti AA 1 \7 1 AATT 


C AAAGTTT T A 


ATCGCTTGTT 


CCACTTTTTT 


CAAAGTTACT 


TCTTTAATGC 


360 


LlllllLliT 


ATTAATTACA 


CGTGAAACAG 


TTCCAACACT 


AACTCCTGCT 


TCTAAAGCAA 


420 


CATCTTTCAT 


GGTAATTGAT 


TTTCTTTGTT 


CTACCATATT 


ATCACCTCCT 


TTCAATATAT 


480 


AGTATCATGC 


AAATGCTTTT 


TAAGCAACTA 


TTTCTCAATC 


ATTTTTGGCC 


AGATCATTTA 


540 


TCCCATCATG 


AATAAAATCA 


CTCCAATTAG 


CTTTTGAAAA 


TACTTCAATT 


TTCATGTGTA 


600 


AACATCTACA 


TAAAACAGGA 


AAAGCCTTGG 


TTTCATGGCT 


TTTTTCGTAT 


CTTCTATAAA 


660 


AAAAGCAAGA 


GTTTTAGATG 


GCTATAAATC 


TAGATGTACA 


TTTTGCTTAA 


ATGATTGAAG 


720 


GTCTTTTCTT 


AACAAAAACA 


CCCCCAAAAT 


TAGACTTTTT 


CTGTCTAACT 


TTTGAGGTAC 


780 


AGTTCAAACG 


CGAAATAGCG 


TTTTTTTGTT 


ATTTTTGGTT 


ACTCATCTAA 


TCGAATAAAC 


840 


ATCATGGCAT 


TTAACAAGTA 


TATGAGTGAG 


ACCGTGTTTA 


TATTATTTGA 


ATAGATGAGT 


900 


CTCTTATTTT 


CAATAGGAGG 


AATAATAAAA 


TTAGAAATAA 


TGATATCATA 


AGGTGAATCT 


960 


TCTAAAGATT 


CCTTTGATAA 


TTCTAATTCA 


GTCCAAACTT 


CCAGTTCAAA 


ATTATTGCTA 


1020 


l_ AA1 AATAAG 


AAAGTGTCTC 


TGCAACGAAT 


TTTGCATGAT 


ACTGATCAAA 


ATTACTCATA 


1080 


Ai- iAAAALL 1 


TTAGTTTAGG 


CTGATTTTGT 


AGCAAATTAA 


TCACCAAATG 


TTTGGTATGA 


1140 




TATAAGATAG 


ATGATTTACC 


ATCATTGAAC 


TAGAACAAAC 


CTCAAGAGTC 


1200 


It- 1 AAATAGT 


G AG AAAGCT C 


TTTTTTTATA 


TCTGAAACAA 


ATTTTGGAAA 


AATATTTTGA 


1260 




TTGTATTCCC 


TTTTTGATCA 


AATAAAATAA 


ACTCAGTAAA 


CAACTCTTGA 


1320 


CGATACAGAT 


GTGCGGTATT 


ATGCAGATGC 


CAAATCAGAT 


TATCCTTATT 


CTCCATTTCA 


1380 


ATCTGATACT 


TGACTGAAAT 


CTGATCAATA 


AAATCACTCA 


ATAGATGGTA 


AGATTTTTCA 


1440 


ACATAACTAT 


CCTTTTTTAC 


GCATTTCATA 


AAGAGACTTT 


CATCTATGAA 


AAACATTTTT 


1500 


TGAAAGTAAG 


ACACAAATAA 


TTGGCAAACA 


ACTTCTTCAT 


CTAAAGAGAT 


ATTGTATTCT 


1560 


GATTCAAAAC 


TCTGAGCAAC 


ACCTTCTATT 


CCTTCTGCCT 


GCATTAAAAA 


ATCCAAACTT 


1620 


TGGTCGTTAA 


AAGAATCTTT 


ATCTACTTCC 


ATAAAATGAC 


CAAACTTTAT 


TCTATATAGG 


1680 


TTCGTAACTA 


GGAGCAACTT 


TAGCATTCTA 


TGCGTTGACA 


AATTCATTGG 


AAAGCTTGTT * 


1740 


TCCTTATAAA 


CCAATTCTAA 


CAATTGAGAT 


AGTGGCTCTG 


ATGAAAAATT 


TTCAAATGGC 


1800 
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CATTCTAGGA AATAATATTT TTCTGAAAAA TATTGTGCAA AAAAGTAACG AATGTCTCTC 1860 

TCATTTCCAA TGATTTGAAC AGGGGTCAGA CTAACTTCAA ATTGAAATTG CCTTTTAATC 1920 

ACTTTATTGA TTTGGCTAAT AATACGATAG AGCGAAGATG AACTGATATA AAATTCTTTA 1980 

CAAATACTCT CAGCTTGACA ACCTTCATTA AAGAAGATGA ATTCTAAAAT CGAAAAATGA 2040 

GTTGAATGTT TAAAGAAATG ATGGTAAACC ATTTCAATAT CACTATCATC GGTATTAATA 2100 

-ATGCGTATAC CATTAGTAGA AGAATGAAAA ATCAAGTCAG GAAAAGCAGA TTTAACATGG 2160 

GATAGATCAT CTTTGACTGC ACGTTCTGTA CAATTTAATA ACTCTGCTAG TTCAGAACGA 2220 

TGAAACCAAC GTTTATGTTC AAATAATAAT TCTAATAATT CTAATTGCCT ATGACTTTTT 2280 

TTAGATAATA AATCTCTCAT GAATATCTTT CTCTCTTTAT AAATTATCGG ATTAAACCTC 2340 

TTGCAATTAT ACCACAAAGA ATAGGTATAG CATGATATAA CGACTTTTCC TAAAATCTTT 2400 

TATTTCGTAT AATAACACTA CGGAGACAAT ATATAAACAA TTTTCTTATT TTACCGTCTA 2460 

TTGAGGGCGT GAATACAGAA TCAAATTCAA GTCTAAAGAT TATATTTTTA ATTTTAAAAA 2520 

TTATATAATA GCAACAATTA AAGAATTTGA TTTTTTAAAA TTATATAATA ATAACAATCG 2580 

AAATAATTGA CTTTTCTATA TTAAAGTTAT ATAATAGTAA TAATCAAAGA AATTGATTTT 2640 

TTGATATTAA AATAAAAAAG GAGGGTAGGC AGTGTTGTGA TCAATTATTG CTGGAGGTCT 2700 

TATTGGTCTC TTGGCAGGTA AAATCACTAA AAAAGTAGTT CTATGGGAAT CATCGCAAAT 2760 

GTATTCGCTG GTTTAGTCGG GGCATATGCA GGACAATCTC TTTTAGGTAG TTGGGGTCCA 2820 

GCAATCGCTG GAATGGCTTT GCTCCCATCT ATTGTAGGTG CAGCGATTGT GATTACTGTA 2880 

GTGTCATTCT TTACAGGTAG AAAGTAAACT TTTCGCCAGT AAAGTTAGCA AACTATTTTT 2940 

AAATCAATGA CGGGAAAAAT AGTTTAAATG TTAAATCGAA AGGATTGTAT ATGTCAAAAG 3000 

CAAAGAAAAT ATGTTTCATT ATTTTCTGTA TTTTAATCTT GACAATTTTC CTTCCTGTTT 3060 

TGATAGATTA TCATCAAGTT AGTGATCTAG GTATTCATCT ACTTAGCTGG AGACAGAACT 3120 

CCGTAGTTGA ATTCTATCTT GCTAGATATG TCTTTTGGGG GACAGTGGTT CTATCAACTT 3180 

TAGTTTTATT ATCCATTTTA GTTGTGATGT TTTATCCTAA ACGTTACTTG GAAATCCAAC 3240 

TTGAAACTAA AAACGATACA TTAAAATTAA AGAATTCGGC AATCGAAGGT TTTGTTAGAA 3300 

GTTTGGTGAG TGATCATAGA TTGATCAAGA ACCCAACTGT TCATGTAAAT TTACGAAAAA 3360 

ATAAATGTTT CGTTCATGTA GAAGGTAAAA TTCTTCCTTC AGACAACATC GCTGACAGAT 3420 

GCCAAATAAT TCAAAATGAA ATAACTAATG GATTGAAGCA GTTTTTTGGT ATTGAGCGTC 3480 

AAGTAAAACT TGAAGTTGCA GTAAAAAATT ACCAACCAAA ACCTCAAAAC AAAAAGACTG 3540 
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TTAGTCGTGT GAAGTAAGGA AGTAAAAAAT GGAATGGCTT AAACAATATC GATATCCAAT 3600 

TATCGCTGGT CTCATAGGCG TATTTCTGGC TTGTTTGATT GTCTCCTTTG GCTTCTTCAA 3660 

AACAATATTT GTATTGATTT TAGGAGCACT GGGAGTTGCA GCTGGATTAT ATATCGAAAA 3720 

AAACTATATA GATAAATAAA AAAATAAAAA TTACTAATTT AATTAAAGGA GTTTCATATG 3780 

TCAAACGAAA AAAACACAAA CACTAACGTA GAAAAGAAAG ATGCTACTGT TGTAGCTCAC 3840 

GAAATCAAAG GGGAACTTAC TTACGAAGAT AAAGTTATCC AAAAAATCAT TGGTCTTTCA 3900 

CTAGAAAACG TTTCAGGTCT TTTGGGAATC GATGGTGGTT TCTTCTCAAA TCTTAAAGAA 3960 

AAAATCGTTA ACAGCGATGA CGTAACAAGT GGTGTTAACG TAGAAGTTGG TAAAACACAA 4020 

GTTGCAGTTG ACTTAAACGT TATTGTTGAG TACCAAAAAA ATGTTCCAGC TTTATATTCA 4080 

GAAATCAGAG AAATCGTATC TTCAGAAGTT GCTAAAATGA CTGACTTGGA AATTGTTGAA 4140 

ATCAACGTAA ACGTTGTCGA CATCAAAACT AAAGAACAGC ATGAAGCAGA CTCAGTAAGC 4200 

CTTCAAGATC GCGTATCTGA CGTTGCTGAA TCAACAGGAG AATTCACTTC AGAACAATTC 4260 

GAAAAAGCTA AATCTGGTCT TGGATCTGGT TTCTCAACTG TTCAAGAAAA AGTTAGCGAA 4320 

GGTGTAGAAG CTGTTAAAGG TGCAGCAAAT GGTGTAGTAT CTCACGAAAA CACTCGTGTA 4380 

AACTAAGATA AAATAAATAT AACAGGAGAA ATTATCATGT CAGTAGAAGA AAAATTAAAT 4440 

CAAGCTAAAG GTTCTATTAA AGAAGGTGTT GGGAAAGCCA TCGGTGATGA AAAAATGGAA 4500 

AAAGAAGGTG CAGCTGAAAA AGTTGTTTCT AAAGTAAAAG AAGTTGCCGA AGACGCTAAA 4560 

GACGCTGTAG AAGGTGCTGT AGAAGGTGTT AAAAACATGT TGAGTGGCGA CGATAAATAA 4620 

GGTTAAAAGT TACTTTATCT TTTTAGTAAT ATTAGTCAAA AGAGTCTGAG TCAAGATGAT 4680 

TCTCAGAAAA CAAAAAGCTA GAGATTCCCA ATTGCGGAAC TCTAGCTTTT TAATTTTGCC 4740 

TCTTTCTCTT ATTATATTTC AGCAGGTTGT TGGCCATGAG TACGAATCCC ATGTCAATTC 4800 

TCACTTGACG CTTACCTCTC AGATGACATC TCTTATAACC CAAACAAACC TTTATCTGCC 4860 

CAAAGACAGA TTTCATATCA ATCTTACGTT TAGCGAAAAT TTGTCTACCC TTGGAAGATA 4920 

AAAGTGCCTG ATATTCTTTA GTTTTTAAAC ACTGGTAACG TTCATTCATA TACAGTCTCT 4980 

TTTGAGGGGC TGATTCAGGT TCATAATCGC AGTCAACATT GATTTCAAGG CTGTTTGCTT 5040 

TCTATCTCCC CGG 5053 
(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


188: 






AATTCTCTTT 


TTTCCAACAA AATGTATGAC 


CTGCACTTGA 


ATACTTCTCA 


TTGTTTGTAC 


60 


ATTCATCTAC 


TTTCATATAA TCTTTTACAA 


AATCATAATA 


TGACATAACA 


CACTATCCCT 


120 


TTTAGACAAT 


ATTCCAATTA GCCTTATTAA 


TTCAAAACTA 


TTGTATTAGT 


AATTATAACA 


180 


GATGTATAAT 


AGAAAAGCAA TGATAGATAT 


TATCAATTAA 


GCGAATTTAT 


ATCTAAAAGG 


240 


GATATTAAAG 


AAAGGAGATA TGCTTATGAA 


GATTTACAAA 


AAACTATTTG 


CTTATGTCCA 


300 


AGATAAGAAA TATCTTGGGG TTTTGGCCAT AATTTTTTCT 


GCTATATCTG 


CTGCACTTAC 


360 


AGTATATGGA 


TATTATTTAA' TCTACAAATT 


TCTAGATAAG 


TTAATAATTA 


ATTCAAACTT 


420 


ATCCGGTGCA 


GAGAGTATAG CATTAAAATC 


TGTTATTACA 


CTAACAAGTG 


GAGCGATATT 


480 


TTATTTTGTC 


TCAGGAATGT TTTCACATAT 


CTTGGGATTC 


AGGCTTGAAA 


CAAATTTAAG 


540 


AAAAAGGGaA 


TCGATGGTCT GGAAAAAGCA 


AGTTTTAGGT 


TCTTTGACTT 


AAATCCATCT 


600 


GGTCAAATAA 


GAAAGATTAT AGATGPCAAT 


GCTGCACAAA 


CTCATCAGGT 


GGTAGCACAC 


660 


ATGATTCCCG 


ATAGTTCTCA GGCAATAATC 


ACACCCGTAC 


TTGTACTTGC 


ACTTGGCTTT 


720 


ATAGTAAGTA 


TAAGAGTTGG CATAATTTTG 


CTTGCTCTTA 


CTATAATTGG 


TGGCTTAATT 


780 


TTAGGGGCAA 


TGATGGGCGA GCAAGAATTT 


ATGAAGATAT 


ACCAAGAATC 


CCTATCTAAA 


840 


CTAAGTGCTG 


AAACTGTTGA GTACGTGAGA 


GGAATGCAAG 


TTGTAAAAAT 


ATTTAAAGCA 


900 


AATGTAGAGT 


CTTTTAAAAG CTTTTATAAG 


GCGATAAAAG 


ATTACTCAAA 


GTATGCTTAT 


960 


GATTATTCCC 


TATCTTGTAA AAGGCCTTAT 


GTTTTGTATC 


AATGGTTATT 


TTTTGGACTG 


1020 


ATTGCAATTT 


TAATTATTCC TATAGTTTAT 


TTTATGACTA 


GCTTAGCTAG 


CGCAAAGGTG 


1080 


ATTTTACTiG 


AG CT TAT CAT ^aTTTTATTT 


TTATCAGGAG 


TTCTCTTTGT 


TTCATTCATG 


1140 


AGAATGATGT 


GtACTCCATG TATATTTCTC 


AAGGAAATTA 


TGCAGTAGAT 


ACTTTAGAGG 


1200 


CGCTTTACGA 


AGATATGCAA AAAGACAAAT 


TAGTGCATGG 


TAATGTCAAT 


AATTTTAAAA 


1260 


ACTATAATAT 


AGAATTTGAG AATGTTAGCT 


TTGCTTATAA 


TGATAAAGCT 


GTCATTGAAA 


1320 


ATTTATCCTT 


TAATTTAGAA GAAGGAAAGT 


CCTACGCACT 


TGTCGGTTCA 


TCTGGATCAG 


1380 


GCAAATCAAC 


AGTAGCAAAA CTTATATCAG 


GTTTTTACAA 


TGTTAATAAA 


GGAAGCATAA 


1440 


AGATAGGCGG 


GATAGCAATA AGTGAATATT 


CTGACGAAGC 


CTTAATTAAA 


GCCATTTCCT 


1500 


TTGTTTTTCA 


AGATTCAAAA TTATTCAAGA 


AGAGCATTTA 


TGATAATGTA 


GCGTTAGCTA 


1560 


ATAAAGATGC 


GACGAAAGAT GACGTTATGA 


GAGCCTTAAA 


ATTAGCAGGA 


TGCGATTTAA 


1620 
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TATTAGACAA ATTCCCAGAA AGAGAAAATA CAATCATAGG CTCAAAAGGT GTTTATTTAT 1680 

CCGGTGGAGA AAAACAAAGA ATTGCAATTG CTAGAGCAAT TTTAAAGGAT TCCAAAATTA 1740 

TTATTATGGA TGAAGCATCA GCATCTATTG ACCCAGATAA CGAGTTTGAA TTGCAAAAAG 1800 

CTTTTAAAAA TCTTATGAAG GATAAAACAG TTATCATGAT TGCACACAGG CTATCTACAA I860 

TTAAAGACCT TGATGAAATT ATTGTCATGG ATAGTGGAAA AATTATAGAA AGAGGGTCTG 1920 

ACAAAGAATT AATGTCAAAA GATACAAGGT ATAAGAGCCT GCAAGAGATG TTTAACAGTG 1980 

CGAATGAATG GAGGGTTTCA AATGAAAGAG TTTTATAAAA AAAGATTTGC TCTTACAGAT 2040 

GGAGGAGCAA GAAATTTAAG TAAAGCAACA CTGGCTTCAT TTTTCGTTTA TTGTATAAAC 2100 

ATGCTTCCTG CCATATTACT TATGATTTTT GCTCAGGAAG TTTTGGAAAA TATGGGCAAA 2160 

AGCAATGGCT TTTATATAGT ATTCTCAGTT TTGATTTTGA TAGCAATGTA TATTTTGCTT 2220 

TCTATCGAAT ACGATAAATT ATATAACACA ACCTATCAAG AAAGTGCAGA TTTAAGAATA 2280 

AGGACAGCGG AGAATTTATC AAAATTACCT CTATCTTACT TTTCTAAACA TGACATTTCC 2340 

GACATTTCAC AAACAATCAT GGCTGATATT GAAGGCATAG AGCATGCAAT GAGCCACTCA 2400 

ATACCAAAGG TGGGCGGCAT GGTACTGTTT TTCCCATTAA TATCTGTAAT GATGCTAGCG 2460 

GGCAATGTCA AGATGGGTTT AGCTGTAATT ATTCCATCTA TTTTAAGCTT TATATTTATA 2520 

CCTTTATCTA AAAAATATCA GGTTAATGGA CAGAATAGAT ATTATGATGT CTTAAGAAAA 2580 

AACTCAGAAA GCTTTCAAGA AAATATCGAA ATGCAAATGG AGATTAAAGC ATATAATTTA 2640 

TCGAAGGATA TTAAAGATGA CTTATATAAA AAAATGGAAG ATAGTGAGAA AGTACACTTA 2700 

AAGGCGGAAG TAACTACAAT TTTAACTTTG TCTATATCTT CAATATTTAG CTTTATATCT 2760 

CTTGCTGTTG TGATATTTGT CGGCGTAAAT CTAATTATTA ATAAAGAGAT AAATTCTCTC 2820 

TACCTTATAG GATATTTACT AGCTGCTATG AAGATAACAG ACTCTTTAGA TGCATCTAAA 2880 

GAGGGCTTGA TGGAAATATT TTATTTATCG CCCAAAATAG AAAGATTAAA AGAAATTCAA 2940 

AATCAAGATT TACAAGAAGG CGATGACTAT AGCTTAAAAA AATTTGATAT TGATCTAAAA 3000 

GATGTTGAGT TTGCCTACAA TAAAGACGCA AAAGTTTTAA ATGGTGTAAG TTTTAAAGCT 3060 

AAGCAGGGAG AGGTCACTGC TTTGGTAGGT GCAAGTGGCT GCGGTAAAAC AACTATCTTG 3120 

AAACTTATAT CAAGACTTTA TGATTATGAC AAGGGACAAA TCTTAATCGA TGGCAAAGAT 3180 

ATAAAGGAAA TATCAACAGA ATCCCTTTTT GATAAGGTGT CTATTGTTTT CCAAGATGTG 3240 

GTTCTCTTTA ATCAAAGCGT TATGGAAAAT ATTAGAATCG GTAAGCAAGA TGCAAGTGAC 3300 

GAAGAGGTTA AAAGAGCAGC AAAACTTGCA AATTGCACAG ATTTTATAGA AAAAATGGAT 3360 

AAAGGTTTCG ATACAGTTAT TGGTGAAAAC GGAGCTGAGC TATCAGGAGG AGAAAGACAA 3420 
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AGATTATCAA TAGCCAGAGC CTTCTTAAAA GATGCGCCGA TATTGATCTT AGATGAGATA 3480 

ACAGCAAGCC TTGATGTTAA CAACGAGAAA AAGATTCAAG AGTCTTTAAA TAATTTAGTT 3540 

AAAGATAAAA CTGTTGTAAT CATTTCACAT AGAATGAAAT CCATAGAAAA TGCAGACAAG 3600 

ATAGTAGTTC TTCAAAACGG AAGAGTAGAA AGCGAAGGTA AGCATGAAGA GCTTTTACAA 3660 

AAATCAAAAA TTTACAAAAA TTTAATAGAA AAGACAAAAA TGGCAGAAGA ATTTATTTAT 3720 

-TAGGAGGACT ACAATGGATA ATAAAAAATT AAAAGTAAAA GATTTAGTAA GCATCGGTGT 3780 

TTTTGGCGTA ATTTATTTTG CCTTCATGTT TGGAGTTGGT ATGATGGGCT TGATTCCAAT 3840 

ATTGTTCTTA ATATACCCGA CAGTATTAGC CATAGTTGCA GGAACTGTTG TTATGTTATT 3900 

TATGGCTAAG GTTCAAAAGC CATGGGCACT ATTTATATTT GGTATGATAT CACCACTTGT 3960 

GATGTTTGCA GCTGGTCATA CCTACGTAGT TGTGGTTTTA TCACTTATAG TAATGATAAT 4020 

AGCAGAATTA ATTAGAAAGA TTGGTAATTA TAATTCATTT AAATACAATA TGCTTTCTTA 4080 

TGCAATCTTC AGCACATgGA TATGTAGCTC TTTAATGCAA ATG CTTTT AG CAAAAGAAAA 4140 

ATATATGGAG TGGTCTTTGA TGACTATGGG AAAAGATTAT GTTGATGTAT TAGAAAAGTT 4200 

AATAACTTAT CCTCACATGG CTTTAGTAGC CTTAGGTGCT TTCTTAGGAG GAATTCTTGG 4260 

AGCATATATA GGCAAGGCTC TATTGAAAAA ACACTTTTCA AATGGATTAT ATTGTGTGGG 4320 

ATACTTTACT CCTTGCCTAA TTTTATGGTG CTATCTGAAT TAAACCCTAT AGTTAAGATG 4380 

TTTTTGAGTA TACCTATTGT TATTAGAATG TTTATTTTAC CATTTATGGC AGCAAGCTTT 4440 

ATGATAAAGA CCTCGGATGT AGGCGCAATA ATTTCATCGA TGGATAAGCT TAAGATTTCA 4500 

AAGAATGTAT CCATACCTAT TGCGGTTATG TTTAGATTCT TCCCATCTTT TAAGGAGGAG 4560 

AAGAAAAACA TCAAAATGGC TATGAGAGTA AGAGGGATAA ATTTTAAAAA CCCAGTCAAA 4620 

TATCTTGAAT ATGTTTCTGT GCCACTACTC ATTATATCAT CTAATATATC AGATGACATT 4 680 

GCAAAAGCGG CAGAAACAAA GGCAATAGAA AATCCAATTG CCAAGACCAG ATACATTCGC 4740 

GTAAAGATAC AGCTAATTGA TTTTGTTTAT GTTTTAGCGG TTGCTGGACT TATTGTGGGA 4800 

GGCTTAATAT GGTTGAAATA AAAAATTTAA GTCTTGATTA TGGTGAAGAG CATATATTAG 4860 

ATGATATATC ACTATCCATA GCCGAGGGAG AGTGCGTGCT ATTTACAGGA AAAAGTGGAA 4920 

ATGGTAAGTC ATCTTTAATA AATTCAATCA ATGGACTAGC TGTAAGGTAT GATAACGCAA 4980 

AGACAAAGGG CGAAATAATT ATTGATGGTA AGAATATAAA AAATTTGGAA CTTTATCAAA 5040 

TCTCAATGCT TGTTTCAACT GTTTTTCAAA ATCCTAAGAC ATATTTTTTT AATGTCAATA 5100 

CGACATTAGA ATTATTATTT TATTTGGAAA ATATCGGTCT TGCAAGAGAA GAGATGGACA 5160 
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GGCGTTTGAA GGATATACTT GAGATATTCC CGATAAAAAA TCTTTTGAAC AGAAATATAT 5220 

TTAATCTATC CGGCGGTGAA AAACAAATTC TTTGCATTGC AGCTTCTTAT ATAGCAGGTA 5280 

CAAAGATTAT AGTTATGGAT GAGCCTTCAT CGAATTTAGA TATTAAAAGC ATAAGTGTTT 5340 

TGGCAAAGAT GCTAAAGATA TTAAAAGAGA AAGGCATAAG CATAATTGTT GCAGAGCATA 5400 

GAATTTATTA TTTGATGGAC ATAGTTGACC GTGTATTTTT AATAGATAAA GGAAAGCTTA 5460 

AAAAAACTTA TACTAGAAGT GAATTTTTAA AGCTAGATAA AAATGAATTA AATGCTTTAA 5520 

GTTTAAGAGA TAAAGAATTA AGTAAATTAA AAGTTCCTTA TTTAAAAGAA GGTGGAGAGT 5580 

ATCAGATAAA AAATCTTAGT TACAAATTTA CTGATGATGA GTGTTTAAGC TTAAAAGATA 5640 

TTTCGTTCAA GCTTGGGAAA ATTTATGGCA TAATAGGATC CAACGGACGA GGAAAATCAA 5700 

CGCTTTTAAG ATGTTTAATA GGTCTTGAGA AAAAATCAAA AGAAGAAATT TATTTTAAGG 5760 

GAGAGAAGCT ATCTAAAAAA GAAAGACTCA AAAACTCTTC ACTTGTTATG CAAGATGTAA 5820 

ATCATCAATT ATTCACAGAT GAAGTATTCA ACGAGCTTAG ATTAGGAGTA AAGAATTTTG 5880 

ATGAAGAAAA GGCGAAAATC ATTTTAAACC CCAATTATTC ACCCCAAATC TAAAAACCAT 5940 

CCAGAATCCT TGCCTTAGCT TAGATCCTGG ATGGTTTCTT TTTTCACCCA ATGGGTGTTT 6000 

TTTACTAGAC AAAAAAGAGT TTCCCCTTTA TGGTATAAGT GTAGAAAAAA ACACAAAAAG 6060 

AAAGGAAACT CACATGAACA GTTTACCAAA TCATCACTTC CAAAACAAGT CTTTTTACCA 6120 

ACTATCTTTC GATGGAGGTC ATTTAACCCA GTATGGTGGT CTTATCTTTT TTCAGGAACT 6180 

TTTTTCCCAG TTGAAACTAA AAGAGCGGAT TTCTAAGTAT TTAGTAACGA ATGACCAACG 6240 

CCGCTACTGT CGTTATTCGG ATTCAGATAT CCTTGTCCAG TTCCTCTTTC AACTGTTAAC 6300 

AGGTTATGGA ACGGACTATG CTTGTAAAGA ATTGTCAGCT GATGCCTACT TTCCAAAATT 6360 

GTTGGAAGGA GGGCAGCTTG TTCACAGCCA ACCTTATCCC GTTTTCTTTC CAGAACTGAC 6420 

GAGGAAACAG TCCATAGTTT GCGATGCCTC AACCTTGAAT TGGTCGAATT CTTTTTACAT 6480 

GTTCACCAGC TG 6492 
(2) INFORMATION FOR SEQ ID NO: 189: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 
AACTGAAGGT AAAGGCTTCG ACGCAGAACG TGACGCTGCC CAAGCTGCCC TTGATGACCT 



60 
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TAAGAAAGCT CAAGAAGACA ACAACTTGGA CGACATGAAA ACAAAACTTG AAGCATTGAA 120 

CGAAAAAGCT CAAGGACTTG CTGTTAAACT CTACGAACAA GCCGCAGCAG CGCAACAAGC 180 

TCAAGAAGGA GCAGAAGGCG CACAAGCAAC AGGGAACGCA GGCGATGACG TCGTAGACGG 240 

AGAGTTTACG GAAAAGTAAG ATGAGTGTAT TGGATGAAGA GTATCTAAAA AATACACGAA 300 

AAGTTTATAA TGATTTTTGT AATCAAGCTG ATAACTATAG AACATCAAAA GATTTTATTG 360 

ATAATATTCC AATAGAATAT TTAGCTAGAT ATAGAGAATT ATATTAGCTG AACATGATAG 420 

TTGTATCAAA AATGATGAAG CGGTAAGGAA TTTTGTTACC TCAGTATTGT TGTCTGCATT 480 

TGTATCGGCG ATGGTACCAG CTATGATATC ATTAGAAATA CAAACATATA AATTTGTAAT 540 

ACCGTTCATA ATTGGTATGA TTTGGACAGT AGTTGTATTT CTTATGATCA ATTGGAATTA 600 

TATAGGCAAA TACTAAGAAG AGACAAAAAT ATATAAATAT TTCTGTACTT ATAGGATATT 660 

TAAAATCAAA ATAAAGTTAA TTTACTTATT TGCAGAGGTT GCAACCCAGC CTCTGTTTTT 720 

CGATAAAAAG GGACGGAATC TCATTTGTTT GGGTTTTGTC TCATCAATAG AAAGGAACAA 780 

AGAGTGTTCG TAACTGAACA CGGGTTTCAG AATTTCTTAC TAAATATAAA AGAAAGGAAT 840 

TGAACCCGAC CTAAATGGTG GTTCGATTCA GAACATCAAT AGAAAGGAAT AAGGGTGTTC 900 

GTAACTGAAC ACGGGCTATG GACTGTGCCA AAAAGATAGT TTTTTCTAGG ACGTAAGCGT 960 

CCGTCGTCAA AACTCCTAGA TGGCTGTGTC CGTTTGACGC CCTTTGTATC TTGAATTATG 1020 

AACAATACTG AATTTTATGA TCGTCTGGGG GTATCCAAAA ACGCTTCGGC AGACGAAATC 1080 

AAAAAGGCTT ATCGTAAGCT TTCCAAAAAA TATCACCCAG ATATCAACAA GGAGCCTGGT 1140 

GCTGAGGACA AGTACAAGGA AGTTCAAGAA GCCTATGAGA CTTTGAGTGA CGACCAAAAA 1200 

CGTGCTGCCT ATGACCAGTA TGGTGCTGCA GGCGCCAATG GTGGTTTTGG TGGAGCTGGT 12 60 

GGTTTCGGCG GTTTCAATGG GGCAGGTGGC TTCGGTGGTT TTGAGGATAT TTTCTCAAGT 1320 

TTCTTCGGCG GAGGCGGTTC TTCGCGCAAT CCAAACGCTC CTCGCCAAGG AGATGATCTC 1380 

CAGTATCGTG TCAATTTGAC CTTTGAAGAA GCTATCTTCG GAACTGAGAA GGAAGTTAAG 1440 

TATCATCGTG AAGCTGGCTG TCGTACATGT AATGGATCTG GTGCTAAGCC AGGGACAAGT 1500 

CCAGTCACTT GTGGACGCTG TCATGGCGCT GGTGTCATTA ACGTCGATAC GCAGACTCCT 1560 

CTTGGTATGA TGCGTCGCCA AGTAACCTGT GATGTCTGTC ACGGTCGAGG AAAAGAAATC 1620 

AAATATCCAT GTACAACCTG TCATGGAACA GGTCATGAGA AACAAGCTCA TAGCGTACAT 1680 

GTGAAAATCC CTGCTGGTGT GGAAACAGGT CAACAAATTC GCCTCGCTGG TCAAGGTGAA 1740 

GCAGGCTTTA ACGGTGGACC TTATGGTGAC TTGTATGTAG TAGTTTCTGT GGAAGCTAGC 1800 
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GACAAGTTTG 


r\*\\~\j IajAAVjVj 


AACVaAL i A TC 


TTCTACAATC 


TCAACCTCAA 


CTTTGTCCAA 


1860 




v» 1 viA 1 AvJAu 1 


AGATATTCCA 


ACTGTTCACG 


GTGATGTTGA 


ATTGGTTATT 


1920 


cc agagkg a a 


<l. J. V_A\jAI_ 1 tjvj 


TAAGAAGTTC 


CGCCTACGTA 


GTAAGGGGGC 


ACCGAGCCTT 


1980 






CCAATACGTT 


ACTGTTAATG 


TCGTAACACC 


GACAGGCTTG 


2040 


A a rr^ a nnnnc 


nAAAAb 1 A\jL 


CTTGAAAGAA 


TTCGCGGCTG 


CTGGTGACTT 


GAAAGTAAAT 


2100 




AAGGCTTCTT 


TGACCATATT 


AAAGATGCCT 


TTGATGGAGA 


ATAATACTCT 


2160 




L 1 1 CAAACCA 


CGTCAGCGTT 


GCCTTGCCGT 


ATATATGTGA 


CTGACTTCGT 


2220 


V- Alj 1 iAlL 


fTl K /"i X TV /"iff ■ » 

1 ACAACCTCA 


AAACAGTGTT 


TTGAGCAGCC 


CGTGGCTAGT 


TTCCTAGTTT 


2280 


1 i 1 1 i At, i 


TTATAGATTT 


TTTAAGACTT 


TCCTAAGTAA 


TGACGGACGG 


TAGTGACCTC 


2340 


PTTVPfc a/*"TVP 


CCATACCTAA 


ACTTTGAACC 


TAAGTTTTAA 


AGTTTCCGGA 


CAGCTGAAAC 


2400 


/~*7i af^fTVTwrvp 
CAALiL. Ibi 1 1 


CAGGTGTTTT 


CATTACGGCA 


GAAAGTCTTC 


GATTTAGTTG 


TGAAATGGTG 


2460 




TTCAAAAATT 


TCTTCAAACC 


ACGTCAGCGT 


CGGCTTGTCA 


TGGGTATGGT 


2520 




<j 1 L.AGTTCTA 


TCCACAACCT 


CAAAACAGTG 


TTTGAGCTGA 


CTTCGTCAGT 


2580 


r nr ,r r a Tpr a a 

1 V— 1 A 1 AtJA 


ALCTTAAAAC 


GGTGTTTTGA 


GCAGTCTGTG 


CCTAGCTTTC 


TAGTTTGCTT 


2640 


111 uA 111 11 


ATTGAGTATG 


AATTACCTAA 


ATTATGATGC 


ATAGTTGATG 


GGATATATAT 


2700 


AAIAIjAI rvjA 


AATAGAATAT 


GAACAAATTG 


ATAAGAGGAT 


TTTAAAGTAA 


TCTCTAACAA 


2760 


1 Ill AUAA 


ACTATGGTGT 


GCTATTCTAA 


ATTCAATTCA 


CTATAACTTG 


TTTACGTTTT 


2820 


AAAAAAtjAGC 


CGTCGGGCTC 


TTTTTACTTA 


TCTTCAGTTC 


CCTGCATTTC 


TTTTATCACA 


2880 


1 Ail rtJTAG 


TCTGGATATC 


CTTTTCCAAG 


ACCTTAAACT 


TGTAAGTCAA 


GTCTTCTTGG 


2940 


1 A 1 1 1 rwA 


TAAGTTCTTT 


TTGCTGGTTA 


ATGATTTGCA 


GGCTGTTTTG 


GATAATATCC 


3000 




I^ATAGCTTG 


AACGCGGTCA 


GTGGTATTCA 


AGACTTCATC 


TGTGATGGTT 


3060 




1 Iaj 1 AALuAIj 


ATAACTTCCG 


GCTGCAGCTC 


CTGCAAATAG 


CAGTAGGTTG 


3120 


GATAATTTCA 


TAGCAACTCC 


TTAAGCGTTT 


TTGATGGTTT 


CAGCGACTTG 


AGCAAGTTTG 


3180 


TCAAAGTCTG 


GTTCGTGGGC 


GATAAAATCA 


ATCTTGAGGT 


CATCGTCAGC 


ACTGTAGCGA 


3240 


GGCACAAGGT 


GAACGTGAGT 


ATGAAAAACT 


GTTTGACCAG 


CGACTTCTTC 


ACAGTTGGAA 


3300 


ATGATATTCA 


TACCAGCAGC 


CTTAGTGACT 


TTCATGACTT 


TTTGAGCTAC 


TTTTGGTACT 


3360 


TGGGCAAAGA 


GTTGGcTGGC 


GCTCGTAGCA 


TCCATCTCCA 


AAAGATTGCG 


ATAGTGTTCT 


3420 


TTTGGCACGA 


CCAAGGTGTG 


TCCTAGTGTT 


ACTTGAGAGA 


TATCAAGAAA 


GGCAAGGACC 


3480 


TGCTCATCTT 


CATATACTTT 


TGAAGCAGGA 


ATTTCCCCTG 


CGATGATTTT 


ACAAAAAATG 


3540 


CAATCTGACA 


TAAAATCTAC 


CTCTACTGTA 


CTGAATTTTG 


ATATAATATA 


GCTACATTAT 


3600 
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ACCAGATTTG 
CCTGTTTTGA 
CTCAATGGTG 
AGTGGCTCCA 
CAGATTGGCT 
- ATCGAAACGG 
CCCTTGTTAA 
AAAGGGATGA 
ATCGTGGATG 
TTGGAAGTGG 
GCGGAGAAGA 
AATCTCCTGC 
TTGGCTCTGA 
TTCGTAAGGA 
TGCTTGTCCT 
AAAATCATTG 
GAGGAACTGC 
AAATTAAGCT 
AGACCCTTTT 
TTTTTCTGCT 
CCAGCAAATT 
AGCGTAAGCA 
ACAGCGTTAA 
GGAAGATTTG 
TCAGTCTTCG 
TTGCGACAGC 
ATCATGCCTT 
AAAAAGGCTT 
TTGTTGGGTT 



GAGAAAATAT 
AAGATGTGTC 
CTGGGAAATC 
TCAATATCAA 
ACATTCCTGA 
TTGCTATGGC 
AAATGTTCCG 
AGCAGAAGGT 
AGCCTTTCCT 
AGAAGCAAAA 
TGTGTGATGC 
AACTACGTGA 
CCAAAGAGGA 
GTGTCTTGGT 
GTTGGGCTTT 
GCCTATCCTT 
CACCTATATG 
CCATCTCAAG 
CTTGCTGTTA 
CTATGTGCTT 
TTTCACTGAA 
AGTCTTGCTT 
GCGTCGTGCC 
GCAAAATCTC 
TCTTCTCTTG 
AGTGGTAGTT 
TGACTACCAG 
ACAGGAGGTA 
GATTACCTTC 



GTTAGAAATT 
CTTTACTGTT 
AACGACGATC 
TGGCCTGACT 
GACGCCTAGT 
TTACGGTATT 
TTTGGAACAG 
CATGATTATC 
TGGTCTTGAT 
GGGCAAGTCT 
CTTTGTCATT 
AGCCTTTGAT 
GGATCTATGA 
TATCTGCGCT 
CTAGCCTACC 
TTGTTTGTAG 
GAGGCTCCAG 
CGTCAAACTG 
TTTGCGCCTT 
TTATTGGGGG 
ACTGGACTGG 
CGTTTCTTTG 
TATCTGGACT 
TATCTGCGTT 
CTTTCCTTGC 
CTCTTTAACT 
TATTTGACCC 
GTTCGAGGAT 
CAAGAAAAAC 



1147 
AAAAACCTGA 
GAAAGTGGGC 
AATGAGATTA 
CTGCAAGGAG 
CTGTATGAGG 
GAGCAAAAAG 
AAATTAGACT 
TGTGCTTTTG 
CCGCTGGCTA 
ATTCTCATGA 
CTTCACAAGG 
ATGCCTGAGG 
AAGACTTGTT 
ATGTGCTCAA 
AGTACAGTCA 
GAATTACGTC 
ACAAGCTCTT 
GCATTTCCCT 
TATTTTTAGC 
TAGGAAAATA 
ACTGGGACTA 
CCCTCTTTAC 
TTATTTTAAA 
CTTATCTGCG 
TGGCGCAGGT 
ACCTCTTGCT 
AACTCTTTCC 
TGACCAGTTT 
TAGCCCTTCT 



CAGGTGGCTA 
AGTTGGTCGG 
TCGGTCTGTT 
ATGCGACTAG 
AATTGACCCT 
TGGCTTTCGA 
GGTTCCCTGT 
TGGTGGATCC 
TTTCTGATTT 
GTACCCACGT 
GAGAGGTGCG 
CTAGTTTGAA 
TTTAAAGAGA 
TGACCACTTT 
ACTCTTACAA 
TGTTTTACTT 
TCTCTTAGTT 
AGTCTTTTGG 
AATGGGTTAT 
TTTCCACTTT 
TGTTATTTCT 
GCAGGTCAAG 
GGCTGTTCAG 
AAATGGCGAC 
TTTTATCGAG 
CTTCCAGTTG 
GCTGGACAAG 
TGTTTTACTT 
AGCCTTACTA 



TGTTCATGTT 
TTTGATTGGT 
GGCACCTTAT 
CTACCGCAAG 
CAGAGAGCAT 
ACGAGTAGAG 
TCATTTTTCA 
AAGTCTTTTC 
GATTCAGCTT 
GCTGGATTCG 
TTCCAAAGGC 
TGATATTTAC 
AAGCAGGCCT 
GTCTTGTTCC 
CATTTTCCTG 
TTACTTTGGG 
GGAGAAGAGG 
CTCTTTGTAC 
GGCTTGCCAG 
TGTCAAAAGG 
CAAGAAAGCA 
GGAATTTCAA 
AAGGTGCCTG 
CTCTTTGCTC 
CAAGCTTGGA 
CTGGCCCTCT 
GGGCAAAAGG 
GTGGAATTAG 
GGAGCTGGTT 



3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4930 

5040 

5100 

5160 

5220 

5280 

5340 
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TGGTTTTACT 
GCTGATACGA 
ACAGGATAAT 
AGCGTCGTCT 
TTTCATTGAG 
AAGCGGCGCG 
TCTGGTAAAA 
GAAAACCAAG 
TGCATGCTTG 
CTGTATCTGT 
TGTTTTGATA 
TCATTGAAAT 
CATGATGTTG 
GCTTGGGCTA 
ATTATTCAGA 
ACACATCCAA 
CTTCAAAATT 
AATTGTTTGG 
CTCGGAGAAG 
CAAAAATGGT 
TTCCCAACAG 
TCCTAGCTCT 
GGGGGCCCAT 
GGAAGTGGCT 
TCACTCGGAG 
TAAGGCCTTG 
TAGTCAAGAT 
ATTTCAGTAT 
TATTCTCTAT 
GCAGGAGCCC 



AGTCTTGTAT 
CACTAAAAAA 
GGTTGGTCCG 
TACCGTACTC 
TATTAACTTG 
AGTTGGAGCA 
TTTTTCTAGT 
AATAGAGTTT 
GTAGGCTTTG 
CATGGGAATG 
CTGTTAGTTT 
GTCTTGAATT 
TAGAAAATAA 
GACCTTTTCT 
CGCAGAAAAC 
CTAAAGCCCA 
TAAGTAATCT 
TTAATCTTGA 
AAGGTCAATA 
TGAAAGAGGG 
ACTAGAATTG 
TTTGAGGTAT 
CAGCTATTCT 
AGAAAGATTC 
TTTATGCCCC 
TCAGCAGAGG 
AGTAAAAGTC 
CCTTTTAAAG 
ACCAAGACAG 
CGTCTCTTGC 



TTGCCTTATC 
GAAGTTGAGT 
TAGAGACTTA 
AAGTACAGCT 
GTCTTGACTT 
TCTGGATCAA 
TCAATCAAGA 
AATTGCTCCT 
TTTAAGTCAG 
CCTCCTTTAA 
ATCACTTTTA 
GCGCTGAGTG 
TCCTTTTAGG 
GTTATTCTTT 
AGTCTATAAA 
TACCGTACAG 
TGAGAACCAA 
TAAATTAAAA 
CGCTGTCAAG 
AGAGTAAGAA 
AAACGACGAT 
TTGGCAAGCC 
TTTTGGATAT 
GGGATCGGGA 
TGTCTTTTCG 
AGTTTGAATC 
TGGCGGAAGA 
AGGTTTACTA 
ACAGGCTGGA 
AGTGCCACCG 



1148 
AGGTAAAACG 



TCAGTCTGTC 
TACTCTTCGA 
TGCGGCTAGC 
GGTCAAAGTG 
GAGCGCTGAG 
ATCGATTATC 
GAAAGCGGAC 
TAATCAAAGT 
TCTGGGTGCC 
ATTCTTTTTT 
AATTTTATGA 
AGTTTTCAAA 
TCTTAGGAGG 
GTAAACATCG 
ATTGTTACAG 
TGTGGGGAAA 
TCGATTGATT 
TATGCCAGAC 
GATGAGAATA 
TGAGAAACTT 
GGACCAACTG 
TGAGATTCGA 
TCCTTATGCC 
CTACCAAGTG 
TCGGATCGAG 
TTGCTTTTAC 
TCTCGAAACG 
ATTTACAGCG 
CTCTTTTCTC 



TCAGATGCAG 
TCAACTTCTT 
AAATCTCTTC 
TTCCTAGTTT 
GAAGCGGTCA 
TCCCATGAGA 
CACTGTTTCA 
GTCGTCAGCG 
ATGAGCTCTT 
AGTCTTACTT 
TTTATTCAAA 
TAAAATAGTT 
GACTGTTTAG 
AGAATCCAAT 
ACGATATCTA 
AAGAAGCTAG 
CCTTGATGAG 
TTCAAGAAAG 
GTCGCTATAG 
TTTGTTTTAG 
TTGAAAGCAC 
CTGGCTGAAG 
AATGAAGAGA 
CTGATTGTCT 
TCTGCTTTGG 
ACAGCCCTCC 
TTTAAATCAA 
TCGCCCAGAG 
AGTTTAGAGG 
ATCAATCCTG 



GACTAACATT 
TTTTGTTACT 
AAACCACGTC 
GCTCTTTGAT 
TAGGCCCGCC 
AGACTGGAAG 
GCCTTGGCTA 
CTTGCCTGTT 
TTGATGGGGT 
CTGGCAACTG 
TCTTTAATTG 
GTAAGCTCAT 
GATTGGGTGT 
GAAATATATG 
CTATATCCAA 
TTTTAATATG 
ATGTCATCGA 
AATCCTTTTT 
AGAAATTCGT 
AGGATGATTT 
ATCATATCAT 
TGCATGAGAA 
TGAAGGGACT 
TTGTGACGAC 
ACTACATTGA 
TCTATGCCAA 
AATTTGCCCA 
CCCATCGTGT 
AGGTTTTCAA 
CAAATGTGGT 



5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 
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GCATTTGGAT AAGAAAGAAA AACTGCTTTT CTTT 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CCACCAGGGA AAATCATTGA AGTTGGTAGT CACCAAGAGT TAATGCAGGC GCAAAGTTTC 60 

TACCATCATC TATTCAATAA ATAAGGAGAA TGTCATGAAT CCTAATCTTT TTAGAAGCGT 120 

CGAGTTTTAT CAGAGACGTT ACCATAACTA TGCGACAGTG TTAATTATAC CTCTTTCATT 180 

ACTATTTACT TTCATCTTGA TTTTCTCCCT TGTTGCCACA AAAGAAATTA CTGTTACTTC 240 

CCAAGGAGAA ATCGCCCCTA CAgTGTCATT GCCTCCATTC AGTCAACCAG TGATAATCCT 300 

ATCCTAGCTA ATCATTTAGT GGCAAATCAA GTAGTTGAAA AAGGGGACTT ACT CATC AAA 360 

TACTCTGAAA CAATGGAAGA AAGTCAGAAA ACTGCCTTAG CAACTCAATT ACAAAGACTT 420 

GAGAAGCAAA AAGAAGGACT TGGAATTTTG AAACAAAGCT TAGAAAAAGC GACTGATCTT 480 

TTTTCTGGCG AGGATGAATT TGGCTACCAT AATACCTTTA TGAATTTTAC TAAACAATCC 540 

CATGATATTG AACTGGGTAT CACAAAGACT AACACCGAAG TTTCAAATCA AGCTAATCTT 600 

TCCAATAGCA GTTCATCAGC TATTGAACAA GAAATTACAA AAGTTCAACA ACAAATTGGA 660 

GAATATCAAG AGTTGAGAGA TGCTATCATA AATAACAGAG CACGCTTACC AACTGGCAAT 720 

CCGCACCAGT CAATTTTGAA TCGTTATCTT GTAGCCTCAC AAGGACAAAC ACAAGGAACT 780 

GCAGAGGAGC CATTTTTATC TCAAATTAAT CAAAGTATTG CAGGTCTTGA ATCATCTATC 840 

GCAAGCCTCA AAATTCAGCA AGCTGGTATC GGAAGTGTAG CAACTTATGA TAACAGTTTA 900 

GCAACCAAAA TTGAAGTACT CCGCACTCAG TTTTTACAGA CAGCCTCACA GCAACAACTA 960 

ACTGTGGAGA ATCAATTAAC AGAATTAAAA GTACAACTAG ATCAAGCCAC ACAGCGTTTG 1020 

GAAAACAATA CCTTAACCTC CCCAAGTAAA GGTATCGTTC ATCTGAACAG CGAATTTGAA 1080 

GGTAAAAATA GAATTCCAAC TGGTACAGAA ATTGCTCAAA TATTCCCTGT CATCACAGAT 1140 

ACAAGAGAAG TACTAATCAC TTACTACGTA TCTTCTGACT ATCTACCTCT ACTAGATAAA 1200 

GGACAAACTG TAAGATTAAA ACTGGAGAAG ATTGGAAATC ACGGCACCAC CATCATCGGC 1260 

CAACTTCAGA CAATTGATCA AACTCCTACC AGAACAGAGC AAGGAAATCT CTTTAAATTA 1320 
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ACCGCTCTTG 


CAAAACTATC 


TAACGAGGAT 


AGTAAACTCA 


TCCAATATGG 


CTTACAAGGT 


1380 


CGCGTCACTA 


GTGTAACTAC 


AAAGAAAACA 


TATTTTGATT 


ATTTCAAAGA 


TAAAATTTTA 


1440 


ACACATTCTG 


ATTAATTTTC 


AGATAACACT 


CTATAACTAT 


TTATTATCTT 


ATCAAAAAGG 


1500 


AGAATCATAA 


CATGGATAAG 


AAACAAAACC 


TAACTTCATT 


TCAAGAACTA 


ACAACTACCG 


1560 


AACTCAATCA 


AATTACAGGT 


GGAGGATTGT 


GGGAAGATTT 


ATTATATAAC 


ATTAATAGAT 


1620 


ATGCTCATTA 


CATCACATAA 


GAACTTCATC 


ATCCAATACA 


ACTATAAAAA 


AATAAGACCG 


1680 


AGAAACAAGT 


ACTCTCGGTC 


TTATTTTTCA 


TCATTCTGTA 


TGTATCACAG 


TAAGTACCTG 


1740 


ACGAAAGACT 


TGATTTTGAC 


AGGTGGTATT 


TAGACTGGTA 


TTAGGATGGC 


TTTCCACAAT 


1800 


CTTCATGACG 


GTATAGAGAC 


CAACTCCTCT 


CTCCTCCCCT 


TTAGAACTGG 


CTCCAAAGGA 


1860 


GAAGATTTCA 


GAAATATCGA 


TGCCCTCTTC 


TTTGATGGAG 


TTTTCGATGA 


TAAAGGTCTC 


1920 


CTGTGCTCCA 


TTTTTTAAAA 


AGGCGATTGA 


AACATGAGGT 


TGACTAGCTT 


CCACACTGGC 


1980 


TTCAATAGCA 


TTGTCACAAA 


GGATAGACAC 


AATGGTTAGA 


AAATCAAGTA 


GACTCATCCC 


2040 


CTCGACCTGA 


ATCTCCTCAG 


GAACTTCGAC 


ATTAAAGACA 


ATGTTCTTAT 


CTCTGGCTTT 


2100 


TAAAAATTTC 


CCTGCTAGAA 


GACTTTTGAG 


GGCTTTATCA 


CGAATATTTA 


CCAATCTGCC 


2160 


CAGGTCATAT 


TTATTGTTCT 


GCAATTTCTG 


ACTGGAATCC 


TTTAAGACGG 


AGCCATAGAC 


2220 


CTCTTTTATC 


TGCTCCATAT 


CCTCCTCTTC 


AATGCCCAGA 


CGTAAGCTAG 


TCAAGAGGTT 


2280 


GGTATAATCA 


TGACGAAAGC 


TCCGTACTTC 


CTTGTAAAGC 


TCCTCTATAT 


GCCGACTATA 


2340 


GCGTTCCATA 


TCTCTATAGC 


GCAGGGCCTG 


CTCTTGTTCC 


AATCTCTCAT 


AGAGTTTTTC 


2400 


CTTCAAATAG 


GTATCCAATT 


TCTTGATAAC 


CCCCATAAAA 


AAGAGTAGGT 


AAAAGACTAG 


2460 


GATGAGATGG 


CGAACAGTCT 


TTGATTGAAT 


ACTTTGTTCA 


TATTCAAAAA 


AAGACAGACT 


2520 


TTCCATGACT 


AGATAGTAGC 


CACCCATTAT 


CCAGTTAATC 


TGAGTCAGGG 


ACTTTTGAAA 


2580 


GGCTTTATCG 


AGAATCTCCT 


TTCTCAAGCT 


AGTAAAATCG 


TAGTCCAACC 


ATTTCAAAAA 


2640 


AGCTAGAGAA 


ATGAAGAAAT 


TGAAAATTAT 


TATACATAAC 


CCAGTAAATG 


AGTAGCCATC 


2700 


ATATACTTGC 


CCTTGTCCCA 


AAAATGGAAG 


CACAAAATAG 


GAGACTCCTC 


TATAAAAGAG 


2760 


ATTCACCAAT 


ATCATTGGAA AGAGACCATA 


AAAGAAAAGG 


AGTTTTTTAG 


GAAGCCCTCT 


2820 


CAATAATAAG 


AAAGATAAGC 


CTATGCCGTA 


CAAGGGTTCC 


ATAAAATAAG 


ATAGGTAAAC 


2880 


ATTTCCTACT 


ATATAGCTAA 


TCATCACAAA 


AACAAAGGCC 


AACAGTATCT 


TCAAAAGAAA 


2940 


GGCCTTAAAA 


ATCCTCTCGA 


AAGTAAGATC 


AATTCCATCC 


ACCTTAAAGA 


AGATGACAAT 


3000 


TTCTAGTCCA 


TTAGTAACAA 


GTGTATACAA 


CAATATCCAA 


GCAATGTTCA 


TAAATTCTCC 


3060 


TAGCTCAGTG 


TAATTTATTG 


ATGGCCTCAG 


ACACTTCCCT 


GACCTTATAA 


CGGGCGATTA 


3120 
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GACAACTTCC ACCATTGGGA GAGAAGAGCA GTTTTTCTTT CTTATCCAAA TGCACCACAT 3180 
TTGCAGGATT GATGAGAAAA GAGCGGT 3207 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

CTGAATCAAG TGTACTGCAC CAGTTCGTGC ATCAGGCATA ACAACATCTA CAGATATAAT 60 

ATTGTTTTCT GAGTCCGCCT CATAAGTTAA AATCATAAAT TTTTCGATAT TCGAATTTTT 120 

AGTAGCTTGT TCAATTTCTT GAATCATTTC ATCAGAAACT AACTCCATCT GAATTGGAAA 180 

GGAATGACTA TTTTCATCAT TTTTGTAGGA AGAATGTTGA TTAAGATAAA GTGTATTCAT 240 

CTGAGCATAT TCAAATAAGT AGCCACTCTT ATTTTTTTGT ACCAAAGGAA ATTGGTTTGT 300 

AAGTCGCTTC TTACCCTTTA TAATTAACAA TACTTTCCCA TATTTTTCTG TATTTGTTTC 360 

AAATTCTAAA TATCCCCAAG TCTGTCCTGC TAATTGTAAT TTATACTCAA ACAAATCTGC 420 

TGATGCAAAT GCAGTATCAA TATGATTAGG TCGCGTCCAT GCATAACCAT TCGACACTAT 480 

CATTGTCTCT CTTTTTTCTA GACGTTCATC TACATAATCT TTTTGCCCTT TCATCAAAGT 540 

ATCTACAATT TTTTGTGCCT CAAGCGAATC AAAGAGATCC TGATTCAACA TAATTCTTCC 600 

TCCTCCAAAT ACTTTTTAAT GAATTATACC ATTTTCTTAA AGAAATTACT ACAATAATTA 660 

TCTTTTTCTT AAAGTTCTGT GTCAGAGTAA TTTAGAAAAT TATATCTTCT ATAGTAAAAT 720 

CAATTAAAAA CTGAACAAAT TTATTGGGAA ATTCAAATCG CTTTCTGAAA ATATTTTAGG 780 

AACCGTAGTG TAATATTCCA GATTCAATTC ACTATAAAAC TGACCTTTCT CCTGCAAAAG 840 

AAAAAGGAAA GACTTCCTTT CGTGCCTTTC CTCTTACTTG CTACTTGTTT GATTATTTTT 900 

GGTAAGCTAC TGCTTGTCTG ATAAAATCCT GAATCGGCTC TCCTTGGTGG AGAGCTTTTA 960 

CTATTTTCGA ACCGACGATA ACACCATCTG ACACCGCATT GAAGCGTTCC AGATCGGCTT 1020 

GACTAGATAC ACCAAAACCT GTCAAGACTG GGATGTCGGC CACTTGATGA AGTTGCGCCA 1080 

AGTGCTTGTC CAAATCTGCA CGGTAATTGC CTGATTTCCC TGTCACTCCA TTGATGGCAA 1140 

CGGCATAGAT GAATCCCTCC GCCCCTTCAA TCAACTCTTT CTGGCGCTCA ATTCCTGTGG 1200 

TCAAGCTTAC TAAAGGAATC AAGGCGATAT CTGTATTTGC CAAAAATGGT TCTACAAAGT 1260 
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TGGCATGTTC ATGAGGCAGG TCTGGGATAA TCAAGCCCTT CACAGCTGTA TCAGCCAGAT 1320 

CTTTGACAAA GTTCTCCACA CCGTACTGAA AGAGGGGGTT GAAGTAGGTC ATGATGACCA 1380 

GTGGAATCTC TGTTTCAATG GTTTTCAAGG TTTCAACTAA AGCCTGGGTA GAGGTCCCGT 1440 

GGGCTAAACT GCGCAAGCCA GCTTCTTCGA TAACAGGTCC ATCTGCAACA GGGTCTGAAA 1500 

AGGGAATACC CACTTCAATT GCAGAGACAC CCAAATCTTC TAAAAAGTGA ATTGTTTCAG 1560 

CAAGACCGTC CAAACCTTTC TCGTGGTCAC CAGCCATGAT ATAGGGAACA AAAATTCCTT 1620 

TTCCAGCTGC TTTAATAGCA TTTAATTTTT CTGTTAGTGT CTTAGGCATG AGCTTCTCCC 1680 

TTCTTTGCTG CATCTGCTTC CAAGCGGTCC TTGACTTGAA CCACATCCTT GTCCCCACGA 1740 

CCTGATAGGC AGACAATCAT AGACTTTTCT GGTCCAAGTT CTTTGGCCAA TTTCACCGCA 1800 

AAGGCGATAG CATGGCTAGA TTCCAAGGCT GGGATAATCC CTTCCACACG AGACAAGAGT 1860 

TGGAATCCTT CCAAGGCTTC TTCGTCTGTC ACAGGGACAT AGCTGGCACG TTTAATATCG 1920 

TGGTAGTGAG AATGCTCTGG ACCGATACCA GGATAGTCCA AACCTGCTGA GATAGAGAAG 1980 

GCTTCAAGAA TTTGACCATG GGCATCTTGG AGCACATCCA TGAGGGAACC GTGAAGGACA 2040 

CCTGGACGAC CCTTGGTCAA GGTAGCTGCG TGGTGCTCTG TATCCACACC AAGCCCTGCT 2100 

GCTTCAGTTC CATACATAGC TACTGACTCA TCTTCTACAA AGGGATGGAA GAGCCCGATA 2160 

GCATTCGACC CACCACCAAC ACAGGCTACT AGGGCATCTG GCAGATCTCG ACCTGTCAAG 2220 

TCACGGTACT GTTGTTTAGC CTCTCGACCG ATGACACTTT GGAAGTCACG AACGATTTCT 2280 

GGAAATGGAT GAGGCCCCAA GGCAGAACCA AGGATATAGT GGGTATCGTC GATATTAGCC 2340 

ACCCATGAAC GAAGGGCTGC ATTGACCGCA TCCTTGAGCA CGCGCGAACC ATCTGTTACA 2400 

GCCTCGACCT TGGCTCCCAA AAGCTCCATG CGGAAGACAT TGAGGGCTTG GCGTTTGACA 2460 

TCTTCCTCAC CCATGTAGAT GGTACATTCC ATGTTAAAGA GGGCTGCAGC AGTTGCAGTT 2520 

GCCACACCGT GCTGACCAGC ACCCGTTTCT GCGATAATTT TCTTTTTACC CATGCGTTTG 2580 

GCAAGCCAAA CTTGTCCTAA GGCATTGTTA ATCTTGTGGG CTCCTGTATG GTTAAGGTCT 2640 

TCCCGTTTGA GATAAATCTT GGCTCCGCCA ATATGCTGGG TCAAGTTTTT TGCGTAATAA 2700 

AGAGGAGTTT CACGTCCTAC GTACTGGCGC AAAAGCTGGT TTAATTCCTC TTGGAAACTT 2760 

GGGTCTGCCT GACTTTCACG GTAGGCCTTC TCCAACTCCA AAACTGCTGT CATCAATGTT 2820 

TCTGGGACAA AACGTCCGCC GAATTTTCCG TAAAATCCAT CTTTATTTGG TTCCTGATAT 2880 

GCCATGCTTT ACCCTCTCTA TAAATCTTCT AATCTTTTCA TGATCTTTTT GTCCATCTGT 2940 

CTCCACTCCG CTCGATACAT CTACTGCATA GGGAGTAAAG TGTTGAATTG CTTTTACTAC 3000 

ATTATCTTCA TTAAGGCCAC CTGCGATAAA GAAGGGCTGT GCTAGTCCAG TCGTATCCAG 3060 



WO 98/18931 



PCT/US97/19588 



1153 



TTGACCCCAA 


TCAAAGGGCT 


GGCCACTTCC 


TGCCACAGGG 


GCATCAAAGA 


GTAGATAATC 


3120 


TGCCTGAGAA 


TTGGGGACAT 


GCCCATTTCC 


ATCTACCTGC 


ACAGCCTGAA 


TACTGGCACA 


3180 


AGGCAAATTC 


TCAAATAAAT 


CATCTGCCAC 


CTGACCGTGA 


ACTTGAACCA 


AGTCCAAGCC 


3240 


AACTTTGTCA 


ATCGCTTCCA 


GCAGTTCTAC 


CCGACTTGGT 


GAAACAAATA 


CTCCAACCTT 


3300 


TTTCACATCT 


GCAGGAATAA 


GCTTTGCCAA 


CTCAGCTGCC 


TCTTCTAAAG 


TCACCTGTCT 


3360 


TTTACTAGGT 


GCAAAGACAA 


AACCGATATA 


GTCGGCTCCT 


GCTGAAACGG 


CTGTTTCCAC 


3420 


CGCTTCTTTG 


GTCGATAGTC 


CACAAATTTT 


AACCTTTGTC 


AATCTGCAAC 


TCCTTGATTC 


3480 


TCTGGGCCAC 


ATTTTCTGCC 


TGCATAAGAG 


CTGTCCCTAC 


CAAAATTCCG 


TTAAAGTATG 


3540 


GGGCTAGTCG 


TTCCGCATCC 


TGCCCTGTGA 


AAATGGCAGA 


TTCAGAAATG 


TAATAGCGAC 


3600 


CTTCCTCAAA 


GTAAGGGGCT 


AAATCTACAC 


TGGTCTGCAA GTCGACCTCA 


AAGGTAGTCA 


3660 


AGTTGCGGTT 


GTTGACCCCG 


ATAATCTCAG 


CACCAAGTCT 


GTGGGCTACC 


TCTAGTTCAG 


3720 


CTAGATTGTG 


AGTCTCCACT 


AAGACTTCCA 


GACCAAGCTC 


TGTCGCGTAG 


TCATACAGTT 


3780 


CCTTGAGGCG 


TTCTTCGGAC 


AAGGCTGCCA 


CAATGAGCAA 


GATAACTGTC 


GCACCTGCAT 


3840 


TGCGAGCGCG 


GATGATTTGC 


TTTTCATCGA 


TGATAAAGTC 


TTTGTTGAGC 


GTCGGAATCT 


3900 


CTACCTGACT 


GGAAATTTCC 


CGTAGATAAT 


CCAAATGCCC 


TTTAAAGAAA 


ACCTCATCTG 


3960 


TCAACACCGA 


AATCATCACT 


GCTCCGTTTT 


CTTCATAAGT 


CTGGGCCTGT 


TGCACAATAT 


4020 


CCACATCGAG 


ATTGATATCT 


CCCAAACTAG 


GGCTAGCTTT 


CTTGACCTCA 


GCGATTACCT 


4080 


GCAAGCGGTC 


CTGATGATTC 


TTCAAAAATT 


CTGCCAAGCG 


ATAGGTCTGG 


CGCAGAGGCT 


4140 


GGATTTGCTC 


CAGCTTCATC 


TGCTCCACCT 


CACGCGCCTT 


CTGCTCTAAG 


ATTCGTGCTA 


4200 


AAAATTCCTG 


ACTCATTTTT 


GGTACTCCTG 


TAACAGTCTG 


AGTTTTTCAA 


GGGCCTTGCC 


4260 


TCTAGCAATC 


ACTTGACGGG 


CCAAGGCAAC 


CCCTTCCTTG 


ATGCTATCAA 


TCTTACCATT 


4320 


AGCATAGAAA 


CCAAGACCAG 


CATTCAAGAC 


TGTCGTTTCC 


AAGAATGGAC 


TTGCTTCGTT 


4380 


TTTCAGAACG 


CTAAGCAAAA TTTCTGCATT 


TTCCTGAGCA 


TTCCCACCAC 


GAATATCTTC 


4440 


CATAGCATAG 


CCTTCCATTC 


CCAAATCCTC 


TGGAGTAAAG 


CTTGACAAGC 


TGATTTCGCC 


4500 


ATTTTCAAGA 


AGTGCAATCT 


TGGTTGTTCC 


GTTCAAGCCA 


GCTTCATCCA 


ACCCTTCTGG 


4560 


TCCAGCAACC 


ACGATGGCAC 


GTTTGCGACC 


CATATTTTTC 


AAAACCTGAG 


CTGTACTTTC 


4620 


TAGGAGTTCT 


GGACGACTAA 


TTCCAAGAAG 


CTGTGTTTCT 


AAAGCCATTG 


GATGAATCAG 


4680 


TGGACCAGTC 


AAGTTCATAA 


TCGTTGGAAT 


TCCCAATTCC 


AAACGAGCTG 


GCATGATGTA 


4740 


TTTCATAGCT 


GGGTGCATAT 


TTTTAGCGAA 


GAGAAAGACG 


ATTCCAGTTT 


TATCAAAGAC 


4800 
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CTTACCTAGT TCAGCTGGTT TGAGGTCAAG ATTGATTCCC AAGG CTTCG A GGACATCTGC 4860 

GGAACCAGAT TTAGAAGATA TCGAGCGGTT ACCGTGTTTG GCCATGTGAA TACCGCCACC 4920 

AGCCAAGACA AAGGCTGCAG TTGTGGAAAT ATTAAAACTG AAAGACTTGT CCCCACCTGT 4980 

ACCACAGTTG TCCATGGCAT CATGAATCTC AGTTGGAATA TGCTGGGCAT GTCCTCTCAT 5040 

GACTTGGGCA ATGGCTGTGC GTTCTTGAGG TGTTTCCCCC TTCATCTTAA GAGCTAAGAG 5100 

GAGAGAAGCA ATCTGCGCTT CAGTTACACG CCCAGTTACG ATACGCTCAA TGACATCCGT 5160 

CATTTCCACA CCTGATAAAT TTTCAAATTT TGCTAGTTTT TCAATAATCT CTTTCATCCT 5220 

AGTTTCCTCA CTTTACAACC TCCTCGATAA AATTCCGAAT AGAAGACAAG CCGTCTGGCG 5280 

TTCCAATGCT CTCTGGATGG TACTGGAAGC CATAAATCGG TAGGTTTTTA TGTTGAATCC 5340 

CCATGATGGC TTGGTCATCA GTCGAACGAG CTGTCACTTC AAAGTCTTCT GGCATTTCCT 5400 

CAATCAAAAT ACTGTGATAA CGCATGACCG CACGGCCATC CTCAATACCT TGATACAAAA 5460 

CAGATGGCGC TTCAAAGTTG ATATTGCTCT GTTTCCCATG CATGACTTTT GGAGCCAAAC 5520 

CTAGCTTACC ACCAAAGACT TCTGCAATGG CTTGGTGGCC CAAACAAATC CCAAGAATCG 5580 

GCTTCTTGCC TGCAAAATCA CGAATCATGT CTTCCATCTT TCCAGCATCA ACTGGCCAAC 5640 

CAGGACCAGG AGAAAAGACC AGACCATCTG CTTTTTCAGC TTCTTCATAC AGCTTGGAAT 5700 

CATCATTTCT CAGAACCTGA ACTTCTGCAA AATTCCCAAT GTATTGGGCC AAGTTATAGG 5760 

TAAAAGAATC ATAGTTGTCA ATCAATAAAA TCATGGTCTT AGTTCTCCAA TTCTAGTCAT 5820 

AGATTTTGCT TTGTTAATGG TTTCTTGGTA TTCGTTTTGG GCGATAGAGT CGTAGACAAT 5880 

CCCTGCCCCA GCCTGCACAT AGGCTCTTTG ATTTTTGAGA ATCATGGTTC GGATGGCGAT 5940 

GGCCAAATCC ATATCACCCG TCGCAGACAA GTAGCCGATT GCCCCAGCGT ATACTCCCCG 6000 

TTTTTCCGTT TCCAGTTCAT AGATACGTCT CATCGCTCGA ATCTTTGGTG CTCCAGAAAC 6060 

GGTTCCAGCA GGAAGCGTTG CTTTCAAGGC ATCCATGGCA GTGAGTTCTG GAAGCAAACG 6120 

CCCCTTGACT ACGCTGGTCA AATGCATGAC GTAGCGGAAG AGCTCCACTT CCATATACTT 6180 

AGTGACTTGG ACACTGGTCG TTTCAGAGAT GCGGCCAATA TCGTTACGCC CCAAGTCTAC 6240 

CAACATTCGA TGTTCTGCTG TTTCCTTCTC ATCAGAGAGG AGGTCAGTCG CCAAGGCCTT 6300 

GTCTTCTTCA TCCGTAGCCC CTCTTGGTCG CGTCCCTGCA ATCGGATTGG TTGTCACGAT 6360 

GCCATTTTTG ACAGAAACCA AACTTTCTGG ACTAGCTCCG ATGATTTGAT AATCCCCAAA 6420 

ATCATAGAAA TAAAGGTAAT TAGAAGGATT AGTCACGCGG AGATTTCTGT AGAAGTCAAA 6480 

TGGATTTCCA GTAACTTCTG CTGAAAAACG CTGGCTGAGT ACACATTGGA ACATATCTCC 6540 

GTTACGAATC AAGTCACGAG CTGTTTCTAC CATTCCCTCA AACTTATGTG GAGCGATATG 6600 
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CGGTTTGAAG 


TCTAACGGAG 


ATAGATCCAA 


ATCTTCAAAT 


TCATTTGGAG 


CAGGAATGCG 


6660 


TAATTCCTCA 


AGCACTTGGT 


TCAAGGATTT 


TTCCAAGGCC 


TCTTGACTGC 


GCTCACTATA 


6720 


AAGTGCATCC 


TCTATGACAT 


GTATCTTCTC 


CTTCTTGTGG 


TCAAAGACCA 


TATAGCTCTC 


6780 


ATAGACAAAG 


AAATGCATGT 


CTGGCGTCCC 


AATTGTATCC 


TCAGGGATTT 


GACCAATTTC 


6840 


TTCATAAAGC 


GAAATCATAT 


CGTAACCCAC 


AAAACCAATG 


GCTCCACCAC 


CAAAAGGTAG 


6900 


CTCTGAGTGG 


TGCTGACTCT 


TATGAATCAC 


TTCATAAAGG 


AAATCCAAGG 


GATCCCGATC 


6960 


AATCACTTGA 


CCATTTTGAT 


AGAGAACCCC 


ATTTTCAAAC 


TTAATCTCAA 


AAACTGGATT 


7020 


ATAGGCTAGG 


ATAGAAAAAC 


GAGCTGTTTC 


CTTGTCTCTC 


GGAATACTCT 


CTAAAATAAC 


7080 


CTTATGTTGC 


CCCTTTAAGC 


GCATATAAGC 


CAAGATTGGT 


GATAAGACAT 


CTCCATGAAT 


7140 


GATTCGTTCC 


ATTGTAATTT 


CCCTTTCAGT 


TCTACTTCTA 


GTCCGTGGTG 


ACTGTATGAA 


7200 


AAATCCCCAC 


GCAAAATAAC 


TTGCGTGAGG 


ACGAAATTCG 


CGGTGCCACC 


TCAATTATAG 


7260 


GATTTCTCCT 


ATCTCTCATT 


CCTGTCTCAG 


ATATCTCCTG 


TAACAGGCTG 


TGCGATAAAG 


7320 


GGCACTCCCT 


TGAGAATGAT 


GTTTTCTTCT 


CTCGTTTCAG 


ATGAACCCAA 


CTTTACAGCT 


7380 


TTCTCTGCTT 


GTTTTCAGCA 


ACCACAAGCT 


CTCTGTGAGA 


GAAAGAACTG 


TAATTTTTCC 


7440 


ATCTATTATT 


TTTTAGCTTC 


TAGTAGTCTG 


CAATCGCAGC 


TAGGTCCTTG 


CCTCCACGAC 


7500 


CAGAGACATT 


GATGAAGAGA 


TGTTCATCTC 


GGTACACCTT 


TATACTCTTC 


GAAAATCTCT 


7560 


TCAAACCGCG 


TCAACGTCGC 


CTTGCCGTAG 


GTATGGTTAC 


TGACTTCGTC 


AGTTCTATCT 


7620 


GCAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


7680 


TGTTTTGAGC 


TGACTTCGTC 


AGTTCTATCC 


ACAACCTCAA 


AACAGTGTTT 


TGAGCTGACT 


7740 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAACAG 


TGTTTTGAGC 


AGCCTGCGGC 


TAGTTTCCTA 


7800 


GTTTGCTCTT 


TGATTTTCAT 


TGAGTATTAC 


TAGCTTTTTT 


CGTATTAGTC 


CAGCCTTTTT 


7860 


GTTTGCTTTT 


AGTAGTAGGC 


ATGGAGCTGT 


AGATAGAACT 


CAAGTTCATC 


AAAGCGACTT 


7920 


AAGGCCCTAA 


TAAAAGATAA 


ACCAAACGAC 


GGATAGAAAA 


AAGCCCACAC 


ACAGAATATA 


7980 


CTTCCGTGTG 


AGGGCGTTGG 


TAACGCGGTG 


CCACCTCAAT 


TATAAAGGGA 


CTATCCCTTT 


8040 


ACATCTCTGC 


CTTGTTTAAC 


AACAAGCTGC 


ACTGTAAGGT 


GTGCGCACCG 


AATTTTCATT 


8100 


GTTTCAAATT 


CATTTTCAAA 


ATCAGCCCAC 


TTTCACTACT 


TCCAACCACC 


TATTCACAAT 


8160 


CACCACAGGC 


TCCCTGAAGA 


TCAAAAATAG 


TTACTTTTCT 


GATTTGTTGA 


ACTTATTTTA 


8220 


ATACTTTGTT 


TTTTCTTTGT 


CAAGACTTTT 


TTACGATTTT 


TTTGAAAATA 


TCATTCGAAT 


8280 


ATGACCATGT 


CTTCCTTAGA 


TCGAACATGA 


ACATGTCCCA 


CTTCTTAGAA 


ATTGGATCCA 


8340 



i 
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ACTCAATAGA AACTGAATGG AGGCTAAACA GAACTTATTT TAGAACACTC CATCTTTTCC 8400 

ACTAGGATTT TCAAGAATTA AACAATACTA GAAACTCTGT CTCCTAACAA ATTTAGGAGA 8460 

AACTTCAACA GATGTGACAC TTTCCCCTTT AATAATTGCT AAAACACCTT CTATCATTTC 8520 

TTTAGCCAAT TTAACATAAT TGGGAGCAAT TGTAGACAAA GCTGGAGTAT AATACTGAGA 8580 

AATAGGAATA TTATCAAATC CAATGATAGA AATATCATCT GGAATAAGAA TTCCTTTCTC 8640 

ATAGCACGCA CGAATCAAGC CCTGAACCTT TTCATCTCCT GAAACAAAAA TAATGTCCGG 8700 

ATAATTTTGG GTAGTCAAGT GCTGCATTGC ATAAGAATAA ACTGAATCAA TTGTAGATAA 8760 

GCCATAAATG ACTTTTAAAT CCATAAAGTA ATTTTTATCA TTCAGAAAAG AACGCACACC 8820 

TCTTTCACGA TCCTTATTAA CATGGGATTC TCCTCCCATA AGCAACCACA TATTTTTAAA 8880 

TTTTTCTTCA GTTACAGCTT TCATCATATC ATAAGTAGCT TGAAAATTAT TATTAGATAC 8940 

ATAGACTACT CCAGACGTTT GAGATTCACC GAAAACAAGA AAAGGCATAT GGTTCTTCTT 9000 

TAAATACTGA ATTCTGATAT CATCTACACT TTCATAAAAA ACAATAACAC CATCTACTAG 9060 

GCTACCTGTG CTTGATATAA TTGAATTACT AATTGTATCC TCCTCTCCAA AGTACTCAAC 9120 

TATAGCATTA ACACCAAATT CTTTACACGT CCGTAACACT TTATCTAACA GCGTATGAAA 9180 

CCAAATTAAA GGAAAAGAGT CGATTTTTTT TACAGAAATC AATATATTTA TAGCTTCTTT 9240 

TTTAGTTAAA TTTTTTGCAT ACGCATTTGG AATATACGAC AATTCCTCTA TAACTTTTTG 9300 

AATCGCTTGA TAAGTTTCTT CTTTAACATT TACTCCACCA TTAATAACTC GTGAAACTGT 9360 

TTTTGGAGAA AAACCTGATA AACGTGCAAT ATCATAAATA GTTACCTTTT TCCCATTTAT 9420 

ATTTTTCATT TCAGTCCTCC ATTACGAACA TTCTAATATT ACTATACAAT ATTTAATTTT 9480 

TTTTAACAAG AGAATTTAGT AAATTATTTA AGATCCACAA ATTCACAAAA TTAATTTTAC 9540 

AAATATTCTT CCCCTTCAAA AAAGTTTAAA TTGCATTTCA CACCTTTATT TTTAAGAATG 9600 

TTTCCAACTT CACGACAAAT AAATTCATAT GAGAAAAAAC TGCCATAAAA TTGTAGATTA 9660 

ACTTTTTCAG TAAAATGTGT AGGATTTATA AAAACATATA ATAGCCTGTC AATGTAACAT 9720 

TTTAACATAG AGTTAATTTT TTCTTTAAAG ATAACATTTG TTATCAACTC ATCAGGAGGT 9780 

AAATGAAAGG CAAACACCAT TTCACAAATA TCATAAAAAG AAATAAATTT GTATACTTGT 9840 

ATCAAACAAT TATTATCAAA ATATTCTATT TTACCTAAAT CAAAATTGAT TTTATAATCT 9900 

TTCATAAAAA CCTCTGAGCA AAAATCTACT CAAAAATTAG ATGATTAAAA CATCTAAAAA 9960 

GCAAAAGGAC AAAAACATCT GTCCCTTTGT TTACTAAATT TCAGCTAATT TCTTCGACAT 10020 

AAATAACACC TACAATATTA GCAATTTCTT CCATCAGTCG AAGATGTTCA AATCTACCTG 10080 

ATAATTCCAG AGTAATAAAT GACGCTATTT TTTTGTCCGG AACATCAAAG TATTCAATTC 10140 
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TGTCAGAATT AACATCTCCA AACGCTGTTC TTGAATCGGT CATTCTGATA CCATTTTCTG 10200 

CACAATAAAC CAATACACGA TTATAGGCTT CTGTAGATTT AACCACTATA TACAATTCAA 10260 

TCATTTTAGA ACGATTTTGC AGATATTTTT TTAGTGGTTG GAACATGGAT ATCACACCCC 10320 

AAACAGAAAT GGCTACTAAA AGAGCTCCCT CATAAGG 10357 



(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 



CGGGACATTC 


TCAATCTTCT 


GTCTTTTGTT 


TTTCTCTTCT 


TTCTATGATA 


CAATGGAAAA 


60 


AATAAATTCA 


AAAGGAGTTT 


TTTTATGACT 


TATCCAAATC 


TCTTGGACCG 


CTTCTTAACC 


120 


TATGTTAAGG 


TCAACACGCG 


CTCTGATGAA 


CACTCTACTA 


CTACTCCAAG 


TACACAGAGT 


180 


CAGGTTGACT 


TCGCAACAAA 


TGTCCTAATT 


CCTGAAATGA 


AACGTGTTGG 


ACTGCAAAAT 


240 


GTTTACTATC 


TACCGAATGG 


TTTTGCTATT 


GGAACCTTGC 


CAGCCAACGA 


TCCGTCTTTA 


300 


ACACGTAAGA 


TTGGTTTTAT 


ATCGCACATG 


GATACTGCTG 


ATTTTAATGC 


TGAAGGAGTC 


360 


AATCCACAGG 


TAATTGAAAA 


CTACGATGGT 


GGTGTGATTG 


AACTAGGGAA 


TTCTGGTTTC 


420 


AAACTCGATC 


CAGCTGACTT 


CAAGAGTCTT 


GAAAAATATC 


CAGGACAAAC 


GCTCATCACA 


480 


ACAGATGGAA 


CAACCTTGCT 


AGGTGCTGAT 


GACAAGTCAG 


GAATTGCTGA 


AATTATGACA 


540 


GCCATTGAAT 


ATCTAACTGC 


TCATCCTGAA 


ATTAAGCACT 


GTGAGATTCG 


TGTTGGTTTT 


600 


GGTCCAGATG 


AAGAAATCGG 


TGTTGGTGCC 


AATAAATTTG 


ATGCAGAAGA 


TTTTGATGTG 


660 


GATTTTGCCT 


ACACTGTTGA 


TGGTGGTCCA 


CTAGGTGAAC 


TTCAGTACGA 


GACTTTCTCA 


720 


GCCGCTGGTG 


CTGAATTGCA 


TTTCCAAGGT 


CGTAATGTCC 


ACCCTGGTAC 


TGCCAAAGGG 


780 


CAGATGGTCA 


ATGCCCTTCA 


GCTAGCAATT 


GATTTTCATA 


ATCAACTTCC 


AGAAAATGAC 


840 


CGACCTGAGT 


TAACTGAAGG 


TTACCAAGGT 


TTTTACCATC 


TAATGGATGT 


GACAGGTAGT 


900 


GTTGAGGAGG 


CGCGTGCAAG 


CTACATCATT 


CGTGATTTTG 


AAAAAGATGC 


CTTTGAAGCG 


960 


CGTAAAGCAT 


CCATGCAATC 


TATCGCTGAT 


AAGATGAATG 


AAGAACTTGG 


GAGCGACCGT 


1020 


GTCACTCTCA 


ACTTGACAGA 


CCAGTACTAC 


AATATGAAAG 


AAGTCATTGA 


AAAAGATATG 


1080 


ACTCCAATTA 


CCATTGCTAA 


AGCCGTTATG 


GAAGATCTAG 


GTATCACGCC 


TATTATCGAA 


1140 



A 
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CCAATCCGGG GTGGAACAGA CGGCTCTAAG ATTTCCTTTA TGGGAATCCC AACTCCGAAT 1200 

ATCTTTGCAG GTGGCGAAAA TATGCACGGA CGTTTTGAAT ACGTTAGCCT TCAGACTATG 1260 

GAACGTGCAG TTGATACCAT CATTGGCATC GTAGCTTATA AAGGCTAAAA AGACGAGGTA 1320 

GCTCAGCTAC TTCGCCTTTC TTTTTATTCT ACTGGTTTTT CTTGATTTCC AGTAGTTGTA 1380 

GAAGATTCTG TTGTTTCATT TTCTGAAGTT GATTCAGCAG GTTTAGAATC TCTTGTATTG 1440 

CTTGGTTTGT TTTCGTCGCT AGCAGTTTCA ATGTTAGATT CTGCAGTTGC GTTTGGTTGG 1500 

TTCTCAGCAC TGGTGTTATC ACCATTTGCT TCAGCATTTC TTGCTGGACT TGTTTCTTCA 1560 

CTTGCGCTAG CTTTTGACTG GATTTGATGA TTCAAAACTA GAATAGCTTT TGTCGATTCA 1620 

AGTAAAGCTG TTTTGTCTTT ACTCTTAGCA GAAAGTTGAT CTAATAATGC ATCCACCTTA 1680 

TCAAAGTCCG CATCAGATCC ATTATTACTT TCTAAATAAG AGTGAAGCGA CATGAGAATA 1740 

TCGTAGAGTT TTTGATAGAG TACAAGTGTC TGAGGATCTT GCTCAGCATT TTCCTTTTCT 1800 

TGTTGAAGGG CGCTAGCGAT ACGAGTCAAG ACATCTTTTA CCTGACTGTT TACTTCATCC 1860 

AAGTCTGCAT CAGCCTTGTT TGTGGCAGCT TTTAGATTTT CTACTTCTTC TGCCAAGGAT 1920 

TGTCTGATTC CTTCTTCATG GATTTGTTCC AAGAGTTGAT TTGCCTTGCT CAAAAGACTT 1980 

TCTACTTCTT CCTTGCTATC TGTCGCAGAT TATTGGTTGC TATCTACCAT GTACTCCTAA 2040 

AACAGGAGAG TTATAATCCA AGATTACAAG GCCTTACAGA AATAAGAAAT CCAGATAAGA 2100 

CAATGTTCGT CCAAGACGCT ATTCGCTTCG CACAGCAGCA CGGATTCAAT ATGCTTTAAT 2160 

TTTAAAGTTT AGGTGTCAAG ACCTCTTTTT AGTGTGCCCA AAATTTAGAG AAGTAATCAA 2220 

TCAACTAACT TTTATTTTTT TCAAACTTTC AGTAAACTGA CCTAAAGCTA ACTCAATCTG 2280 

TCTTTGTAGA TGCTTCTGCT ATCAGCTAGA AGTTGATCTA CTTTTGCCAA GACTGCCTTC 2340 

TCATCAAAAG TTCCAGGTTG ATAGTTGGAT TGCAGGGATG GAATCTTGTT TTTCAAAGCC 2400 

GCTTCATATC CCTTAGTTTG AACCTTGATG TAGTGATTGT GGTCGCCATG AGGAATCACA 24 60 

AAACCTTCTG AATCTTCACT TATAATTCGA TTGGCATCAA AACCATGACC ATCTTCTTCC 2520 

TCATGATGGA CATGTAGTGA CGGATTACTT AATACAGAAC TAGAAGAACT TCCTACCTCT 2580 

TCCGTGTTAG AGTGTGATGG GGGATTGTTA AGAGATGACT TAGGAATATA GTGATAGTGA 2640 

TCCCCATGTC TTACTATATA AGCATCACCT GTATCTCTGA CAATATCATT AGGGTTAAAG 2700 

ACATATGTGG CTGCTAATTC ACCTGCCGAC AAGTCACTCT CAGGAATGAA ATGATAGTGA 27 60 

CCACCATGTG GTACTATAGT AGATTGAAAT AGAATATGAG CAAATTGATA AGGGGATTTT 2820 

AAAGTAATTT CTAACAATGA TTTAGAAACT ATGATGTGCT ATTCTAAATT CAACTCACTA 2880 

TATATAACCA TCATCGGTAG TATAACGTCC CTGTAATTTT GCTACAGATA CTTCTGCACT 2940 
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AGCTCCTTTA 


TCGTCTTTAC 


CATGTTCTTG 


TTTTTGGCGA 


TTGATTTCAT 


CTTTTGTTCG 


3000 


TACATTTTCT 


GCATGAGCTT 


GATCTTTAAG 


GTAAACATAA 


TACTTTCCAT 


CTACCTTAAT 


3060 


AATATATCCT 


CCCTTAACCT 


AACTGACGAT 


ATCTTGATCT 


TTCGGCTGAT 


AGTTGGGGGC 


3120 


TTTCATTAAT 


AGCTCTTCAC 


TAAAGAGCGC 


ATCAAAAGGA 


ACTTTACCAT 


TATAGTAGTG 


3180 


ATAATGATCG 


CCATGAGAAG 


TTACATAACC 


TTGATCTGTA 


ATCTTAATAA 


CAATTTGTTT 


3240 


TGCTTGAATT 


CCTTCTTTTT 


GACTAACCTA 


GTCTGGAGTC 


AAATTTTCAG 


TCTTCTTAGT 


3300 


GTCTTTATTA 


CTGTTTACAT 


ATGAAACACG 


ATTTTTATCT 


GTATTGGCCT 


GTTAGCTATG 


3360 


TTGGTTCAGA 


GCATAAACAC 


ACAGACTTAA 


GGAAAGGATA 


ACAACAGATC 


CAGCTGCTAT 


3420 


ATATTTCTTT 


TTAAATTTCA 


TAATTACCTC 


ATTTCTATAA 


TTATTTATAT 


GATGTCTTCA 


3480 


TTATTAAATG 


ATTAAATAAA 


TTAATTAACC 


AATTAATTAA 


CTAGTAAATA 


TTCCACCTCT 


3540 


TTTTAAGTTG 


TATGTCAAGA 


AATTTTATAT 


ATTAATAATA 


AAATGAAATT 


CTCCCAAAGT 


3600 


CAGAGTTTTA 


TTTCTAACTT 


TTGAGAGAAC 


TTCATTTTTG 


ATTCAGACTT 


TTTCTACTGC 


3660 


TATTCCTTAC 


GCTATGAGAT 


CAGATAAATT 


CTTTTTTATC 


ACTTCTCCAC 


TTGGCAATCT 


3720 


TAATTCAATC 


GTTCCATCCA 


TATTGAATAT 


AACACTATCT 


AAGCCTAATC 


CGTAACTAGC 


3780 


TGTAAATTTT 


TCTAATTTTT 


CTTGTACAGG 


ATCTACTGCT 


GGAGCTTCCT 


CTAATGCTGG 


3840 


ATCTAACATA 


GGGTCACTCC 


CCACATTCCC 


TTCTGGATTC 


AACATTCCAT 


TATCCGTTGA 


3900 


GTTTTCTGGT 


TTTACAGGTT 


TTTCGTTTGG 


TGCCTCTGGT 


AAAGAATCTG 


CTGGTTTATT 


3960 


TTCTGTTGGT 


TGGTTCTCAA 


CTGTTCCAGT 


AGATACTTTT 


CCATTTTCAG 


ATGGTTTATT 


4020 


TTCACCATTT 


CCTTGAGGTG 


CTTCTCCTGT 


AAAATCTGCC 


ATATTCTTTT 


TAATGACTTC 


4080 


TCCCGATGGT 


AAATATAATT 


CAATTGTTCC 


GTCCATATTA 


AACAAGACAT 


TTTCTAGCTT 


4140 


CATCCCATAA 


CTTTCAGCAA 


ATTTTGCTAC 


TTTTTCTTGT 


ACAGGATCCA 


CTGTAGGAAC 


4200 


TTCTTCTAAC 


GTTGAATTAC 


TAGTACTATT 


CCCAGTTTCA 


GAAAGTTTTT 


CTTTTTCTAC 


4260 


CTTCTCACTA 


GTCTTTGGTT 


CTTCTACCTT 


TTCATCAAGT 


TTTAAGTTTT 


CTTGTGCTTT 


4320 


ATTCCTTTTA 


AATTGTGGTA 


GAATACTTGG 


TTTATCAGTT 


TGATTTTCTT 


TTTCCAAGAT 


4380 


AGGTACTTCC 


ACAATATAAG 


TCGATTGATT 


GTCCAAATAA 


GCATTTGCCA 


TGAAGGTTAC 


4440 


AGGAATTTTA 


TTTCCGGCCG 


TTCTGGTTGT 


TCCTTGGTTT 


AATTTCGGAA 


TCGGTAATTT 


4500 


GATTTCACCA 


ACTTTATAGT 


TATTTTCTAA 


ATAAGCATTT 


CCATGAAATT 


CATCAAACAC 


4560 


TCTGACTAAA 


GCATCAGTTC 


CTTTAGGCAC 


TGCAAATTGA 


GGGTTCACTC 


TTAAATAAGT 


4620 


ATCCCCTGCA 


TGGAAAGGAT 


AGAAAATCGT 


TTGACTGGCC 


ATTTTGTAAG 


CTAAAGAGGT 


4680 
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TGGAACTGTA 


AATGTACCAT CATAACTTAC 


TTCTGGATAA 


TCTTTTGAAG 


1 CGATAGTATA 


4740 


CTTAAATGTT 


TGTCCTGGTA AATAAGGTTG 


ATCTAATTCA 


AAGTTTGCAA 


. TATTCCCTAC 


4800 


TCCTTCTCCA AATACTTTAC CAGATACTTT CTCCAATACT 


TTTCCATCTG 


GTGTTATTAA 


4860 


TTTTACTAGC 


ATATTGATAC CTAATTTTTT 


CTCCAATTCA 


GGCGGAAAAC 


TAAAAGAAAC 


4920 


GCGTTTTTGA 


CCATTGGCTA GAGTAAAGTT 


TTGATTATTA 


AACGTACTAT 


TTTTTAACAA 


4980 


ATTAACAACA 


TTCGTTAATT CTTCTCCAGT 


ATAAACTTTA 


TTCCCTTCTT 


TTTTAGCAAC 


5040 


TCCTTCTTCG 


GGTTTAAACA GTTCATAGTT 


ACTGTGAGAA 


TGACCAATTC 


CAACCGGTTT 


5100 


ATGTTCATCA 


ATCGGATCTG CATGATGGTG 


ATCTCCATGC 


GGATAAATAA 


TCGCATTTTT 


5160 


TTCTTTATTC 


ACGACAATAC TTTCACGTTT 


GACACCATAT 


TGTTTCATAA 


TGCCAGCAAT 


5220 


TTTTTCTTCG 


ATTTTTTTAT CTAAATCTTT 


CATTTCTTTG 


GCATTACTTG 


GATAATCCTG 


5280 


TTCATGAGAT 


GACAAAGAAT CTAATCCATT 


ATGACTAGTT 


TTAACTTCCT 


CTAAATGTTT 


5340 


TTGCGCAsCT 


TAATTTGCTC TTCTGTCAAG 


TCCTTCTTGA 


AGAAATAATG 


ATTGTGGTCT 


5400 


CCGTGACTCA TGACAAAACC TGATTCATCT 


TCAGCGATAA 


TACGATTAGC 


ATCAAATCCG 


5460 


TATCCATCTT 


CTTCATGTTT CTCATGTGAA 


GTTCCTGGAT 


TGATTGGAAG 


AGATGGAGAA 


5520 


GGTGTTGCTA 


GACTATTGTT TGGAAGAGTC 


GGTTGCCCAA 


TTTGATTTGA 


TTTTGGAATG 


5580 


TAATGGAAAT 


GATCACCATG TCTTACAATA 


TAAGCTGTAG 


CCGTTTCTTC 


AACGATATCT 


5640 


TTTGGATTAA 


AAATATAACC ATCAGATGCT 


GAAGAGAGCT 


CCTTACTTGT 


CGTTAAAGAA 


5700 


GAAGGATTGC 


TTGAAAGACT GCCTAGACTA 


GACACTACTT 


CATTAGGTTT 


TGCATTTGTA 


5760 


GAAACTGTAG 


AACCAGTTCC ACTGATAGGC 


ACCATTCTGG 


CAATCTTTTC 


TTCTAAGGCA 


5820 


GAAAGCTTGC 


TGTAAGGAAT AAAGTGGTAA 


TGGTCGCCAT 


GCGGAATCGC 


AACTCCATTT 


5880 


GGTGTACGAC TGATAATCTT AGCAGGGTCA AAGACCAGGC CATCTGATTC ACTGTAACGT 


5940 


TGGGCGCTAG 


GTGAATCATA GAGTTCCTTC 


AAAAGACTCT 


GGAGATTTTC 


AGATTTATTT 


6000 


GCTGGCTTGC 


TAGTTGATCC TTTTGCTACA 


GATTGCGTGT 


TATTGTCACT 


AGCTGTTGAA 


6060 


GAATAGCTTA 


ACTGACTCGG TTGCATATTT 


TTTCCAGCCA 


GATGTGCTTT 


AGCTGCTGCT 


6120 


AATTCACTAG 


CAGATAAATC GCTTTTGGGA 


ATGTAGTGAT 


AGTGACCTCC 


ATGAGGAACG 


6180 


ATATAAGCAT 


TACCCGTATC TTCGATAATA 


TCAGCTGGAT 


TAAAGACATA 


ACCATCATTT 


6240 


GTCGTATATC 


GTCCCTGAGA CCTTGCTACA 


GCAACATTAG 


AGTTAACCTT 


CTCATTATCT 


6300 


TTGACATGTT 


CTTGTTTTTG ACGATTGATT 


TCATCTTTAG 


TTCGAACATT 


ATCAGCATGA 


6360 


GCTGCATCTT 


TCAGGTAGAC ATAATATTTT 


CCATCGACCT 


TGATGATATA 


ACCACCCTTG 


6420 


ACTTCATTGA 


CAATATCAGC GTCTTTAAGT 


TGATAGTTTG 


GATCCTTCAT 


CAAGAGTTCT 


6480 
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TCACTAAAGA GGGCATCATA AGGAACTTTC CCATTATAGT AATGATAGTG GTCACCGTGT 6540 

GACGTTACAT AGCCCTGATC TGTAATTTTG ATTACAATTT GCTCAGCCTG AATTCCTTCT 6600 

TTCTGGCTAA CCTGGTCTGG TGTCAAGTTT TCACTTTTCT GACTTGACTG GCTGCCATCC 6660 

ACATAAGAGA CACGATTATT GTCCTTATTT TCCTGCGAAC GATGCTGGTT TAGTGCATAG 6720 

GCACATAGAC TCAAGGATAC GATAACAGCT GATCCAGCTG CTATATATTT TTTACTAAAT 6780 

"TTCATAAATC CCTCATTTCA ATAAATGATG AAGTTTTTTC TCAACTTCTT TTACTTTATT 6840 

AAATAGTTTT CTAAACCCGG GGGTACC 6867 



(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

CGTTCTAAAA ATGCAGTACG TTTGATTGAG AAATCAGTTA AAGGTATGCT TCCACACAAT 60 

ACACTTGGAC GCGCTCAAGG TATGAAGTTG AAAGTATTTG TTGGAGCTGA GCACACTCAC 120 

GCTGCACAAC AACCAGAAGT TCTTGACATT TCAGGACTTA TCTAAGGAAA GGAACAATAA 180 

AGTATGTCAC AAGCACAATA TGCAGGTACT GGACGTCGTA AAAACGCTGT TGCACGCGTT 240 

CGCCTTGTTC CAGGAACTGG TAAAATCACT GTTAACAAAA AAGATGTTGA AGAGTACATC 300 

CCACACGCTG ACCTTCGTCT TGTCATCAAC CAACCATTCG CAGTTACTTC AACTGTAGGT 360 

TCATACGACG TTTTCGTTAA CGTTATAGGT GGTGGATACG CTGGTCAATC AGGAGCTATC 420 

CGTCACGGTA TCGCTCGTGC CCTTCTTCAA GTAGACCCAG ACTTCCGCGA TTCATTGAAA 480 

CGCGCAGGAC TTCTTACACG TGACTCACGT AAAGTTGAAC GTAAGAAACC AGGTCTTAAG 540 

AAAGCTCGTA AAGCATCACA ATTTAGTAAA CGTTAATTCG AAAGAATTAC TATACTTATA 600 

CAGAGCACCT TTCGGGGTGT TCTTTTTTTA TACTTTCTTA CTAAATTGGT GCAATTGACA 660 

CAGTTGTTGC GACTTTAGTC GCTTACAAAT GTGGCTGCAA CCTGACATGG TCAGTTGCCT 720 

CAAAACGTTA ATCAATACGA TTATATCAAC GTTTCAAAGC ACTCAAGGGT TTACCCTATG 780 

GGTGCTTTTT TCTATACTTT CTAAAAAAGT TTACCCTAAA ATTTGCCCTA AAATTACCCT 840 

ACTTATTTTT AAGATGTTGG TAGGCAACTT GTCCAGCAGA TAATGGAACT ATGTTTGAAG 900 

TATTAACATA AGTCTTAGTT GTAACGGTAT CGCTATGAGT TAATGCTTCA GAAATGGCTT 960 



WO 98/18931 



PCT/US97/19588 



1162 

CTAAGCTCAT TCCTGCTTTT TTAGCAAGTG TCGCTCCTG 
(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AATATTATCA CTGTTCTTGA AGGCAGAACA CAAGCTGTCA TCCGAAATCA CTTTCTTCGC 60 

TACGATAGAG CCGTTCGTTG TCAAGTGAAA ATCATTACGA TGGATATGTT TAGTCCTTAC 120 

TATGACTTGG CTAAACAGCT TTTTCCGTGT GCTAAAATCG TTCTAGATCG TTTCCATATT 180 

ATCCAACATC TCAGCCGTGC CATGAGTCGT TTTCGTGTTC AAATTATGAA TCAGTTTGAA 240 

CGAAAATCTC ATGAATACAA GGCTATCAAG CGTTACTGGA AACTCATCCA ACAGGATAGT 300 

CGTAAACTCA GCGATAAACG TTTTTATCGC CCTACTTTTC GCATGCACTT AACAAATAAA 360 

GAAATTCTTG ACAAGATTTT AAGCTATTCA GAAGACTTGA AACACCACTA TCAGATCTAT 420 

CAACTCTTAC TTTTTCACTT TCAGAACAAA GACCCTGAGA AATTTTTCGG ACTCATTGAG 480 

GACAATCTGA AGCAGGTTCA TCCTCTTTTT CAGACTGTCT TTAAAACCTT TCTAAAGAAC 540 

AAAGAGAAAA TCGTCAACGC CCTTCAACTA CCCTATTCAA ACGCCAAATT GGAAGCGACC 600 

AATAATCTCA TCAAACTTAT CAAACGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 660 

AAAAAACGGA TTTTTATCGC TCTGAACATC AAAAAAGAAA GGACGAAATT TGTCCTTTCT 720 

CAAGCTTAGC TTTTCTTCAA CCCACTACAG TTGACAAAGA GCCTATTTTC GCTGATTCTC 780 

CACTACATTT GACTGGATTC TAATTTTTTA GAGAAATACA AAAGAGCTAG CTTTAGCTAG 840 

CTCTTTTCCT ATGCGGAGAG AGGGACTTGA ACCCTCACGA CCTAAAGCGG TCACAGGATC 900 

CTTAGTCCTG CGCGTCTGCC AATTCCGCCA TCCCCGCGTC GATTACTTTA CTAGTATATC 960 

AACTTTTGGG ATGCTTGTCA ACACTTTTTT TCAAATTTTT TCATTTTCAC CAACCAGGTT 1020 

ACTCAAAAAG TTCATTTAGA TTTTCATCTA CTAACTTAGC TCCGAGTGTA TTTTTGAAAT 1080 

GACCTAGGGC AAATTGATGA TTTTCAGGCC AGATGGAAGC AACAGCTGGT TTAACAATCT 1140 

CGATGTCATA TCCTAGATTA TAGGCATCTA TAGCTGTATG TAGGACACAG ATATCCGTCA 1200 

AGACACCTGT TAAGATAACG GTAGACACTC TACGCTCTCT CAAACGAATA TCTAGGTCAG 1260 

TCCCTGAAAA AGCTGAGTAA TGGCGTTTAT CCATCCAAAA GACACGACTG TCTGAACCAT 1320 

GCTCTTGATA AAAGATCCCC AAATCTCCAT ATAAATTCCG TCCACTCGTC CCAATCAGAT 1380 



J 
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TATGAGGAGG 


AAATAACTTA 


CTTTCCGGAT 


GGAAACAATC 


GTTTTCTTCA 


TGAGCATCAA 


1440 


TAGTAAAGAA 


GATATAATCT 


CCTCGTTCAA 


AAGCTAATCG 


AGTTACCTTG 


CTGATGGCAT 


1500 


CCGAAATCGC 


CTGAGCTGGA GCACCTGCTG 


TTAGTTTCCC 


ACTATCAGCA 


ACAAAATCTT 


1560 


CTGTATAATC 


AATCGAAATT 


AAAGCCTTTG 


TCATTAGTAA 


TCTCTTTTCT 


TCACTTCTTC 


1620 


AAAAATATCT 


GAAATCAAGA 


CCTTAAGATA 


GGTTCCCTTC 


ATTCCAAGTG 


AGCGACTTTC 


1680 


AATAATCCCC 


GCAGACTCAA 


GTTTACGAAG 


AGCATTGACA 


ATCACAGAGC 


GAGTGATTCC 


1740 


GATACGATCT 


GCAATCACTG 


ACGCAGTCAA 


CTTCCCTTCA 


TTTCCATTTA 




1800 


AATTGCTGAA 


ACAGCACGGA 


GTTCGGAGTA 


AGAAAGGGTA 


TTGACCGCCA 


TGGTGACAGC 


1860 


AGTACGACGA 


CGAATATTTT 


TCTCATCTTC 


TTCACGTTGG 


AAGTTAAGAA 


GCTGAATCCC 


1920 


AACAACGGTA 


CTGGCAATCT 


CAACAAGAAC 


CAAGTCCTCA 


TCTTCGAATT 


TTTTATCATT 


1980 


ACGCCAAATA 


ATCAAAGAAC 


CAAGGCGAAT 


CCCCGATACA 


TGAATCGGTG 


CAATAGTCGT 


2040 


CAAGCCATCT 


GGAAAATCAT 


CTCTACTCTC 


AATAGGGAAA 


ATACTCATAT 


CATGCTCAAC 


2100 


AGGCAAGTTT 


GCTTCTGTTT 


CGTAAATCAT 


ATTAGCCCCT 


TGAACGTAGT 


CATCTGGGAA 


2160 


AATCTTAGTT 


TGGAAGAATT 


GCTtACGCGA 


TCTGTATTTG 


TTTTATAACG 


CATAAAATAG 


2220 


CCAAGCAGAC 


GTCCCTTACT 


ATTGATAATG 


CAGGCATTGC 


AATGAATAAT 


ATCCGCTAAC 


2280 


TGACGCGTAA 


TAGCGTTGTA 


AGGGAGCTCA 


TCTCG 






2315 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CGATTTCTTC CATTTCTTCA AATAAGAATA CTTCATCTGA CATATGTGTT ACCTTCTTCA 60 

TCAAAAATTA TTTTGTAATC GATTACATTG CAGATCGTAA CATAAAGAAA AACAGATGTC 120 

AAATATTAAA CGTAAAAACA TGGTCACTAA AGAACTATAA GAGAAAAGGT AAACCTAGCG 180 

ACGCGATGAA CGCTGGGTCG TTTGGTTTCG ATTGCTCTCT TCCTCTTGTT TTTTCTGTTC 240 

TTCTTCTTGT TTTTTCTCAG CTTCCTTGGC CTCTTGTTTG GCTTTTTCCT CAGCTTCCAT 300 

AATTAATTTA TCCGCCACAG TGTAGCTGTA GATTCCAGCT TCCATGTCGA CCACACTCGG 360 

TTCTGACAAT TGAGGCTTAA TCTTACTGTA ATATGGCAGT TTCTTACTCA TTTCAGATAG 420 
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AGGAACCAAG 


ACTTCGTCCG 


AATCATTCAT 


GGTCAATCGA 


ATTAAATCGG 


ATGTCACCTT 


480 


GCTTGGGGCT 


AATTCCACCT TTTGGATAGC CGCCTTGAGT 


TCTGGGCTAA 


TTTGAGCAAG 


540 


TTCTGAGACA 


AAAACTTTGA 


TTTGTTCACT 


ATCATTAAAG 


AGAACTGATA 


AATAAGTTTC 


600 


TGGTAAACTG 


TTCAGACTCA 


CAGAACTAGT 


CTCAAGCTGA 


CCACTGGAAA 


GAATAGGATA 


660 


ATGATTTTCA 


CCAGAAATAT 


AGTAGGCCAC 


AATATCATAT 


TCCTTGACCT 


TAATAGTGAA 


720 


CTTAGTTGGA 


AATTGATAGA 


CAAGTTGAGC 


TGATTCAACC 


CAATAGTTAG 


ACTTAATCTG 


780 


CTTTTCATAT 


TTTGCCTTGT 


CTAGCAGAAG 


GTTAATCGTA 


TAATCCGAAT 


CCTGAATGCC 


840 


TGAAGCCTGT 


CGAATATCAT 


CAGCTGTAGT 


TTGCACCGTT 


CCCTCAACAC 


GAATATCTTT 


900 


CATGGTCGCA 


TAAGGACTGA 


GCAAGTAGGC 


AGAGACAAAC 


AATAGAAGCA 


GACTTGGAAA 


960 


TAAAATCGTG 


AAGGCTCGCA 


AGATATGGAT 


ACCAGGAATC 


TTTGCTTTGG 


CTGGTTTTTC 


1020 


CTTTGTAGCC 


TTTTTAGCAA 


GCTTTTTATC 


CTGTTCCTCC 


TTCTCTTTAG 


ACTCTGGTTC 


1080 


TTCTTTCTCT 


TCTTTCTCTT 


TGTCAGCCTC 


TGAGGATGCT 


ACTTTTTCTT 


CAGACTCTTC 


1140 


CTTAGCTGAT 


TCTGAATCTT 


CCTGGTCTGT 


TTCACTCTCC 


TGGTCCTGTT 


TATCCTCTGA 


1200 


CTTCTCAGAT 


TCTTCTCCCA 


TTCGAGCTTG 


TCTTTCCTTT 


TCCTTCTCCT 


CAGCTAGAGC 


1260 


CGCCTCTTCT 


TCAGCCTTCT 


TTTTTAGATA 


TTCTTGGTTT 


CGTTTCTGCC 


ATTCTGATAA 


1320 


CTCTTTCAAT 


TCTTCGAGGG 


TTTCTTTGTC 


CTCATTTTTC 


TTATCTTTTG 


ACATTTACTT 


1380 


TCCTTATGAT 


AAATCTTTTT 


TCAACAATTG 


ATAAAAATCT 


GCTAGAGATT 


TCAATTCCTT 


1440 


AGAAGCTTTC 


ATCTTAGCTT 


GGTAATCTTC 


CTTGTGACTT 


AGTAAG TGAG 


AAAGCTTCTC 


1500 


TTCCAAACTA 


TCCAAGGTCA 


AATCGCTTTC 


TTGAAGGTCT 


TCTGCATAGC 


CTTTCTTAAC 


1560 


AAAGTAAGCT 


GCATTTTCAA 


TCTGGTCACC 


ACGACTAGCT 


TCACGACCAA 


GCGGCACAAT 


1620 


GACATGCAAT 


TTTGCTATCG 


CCAAGAGCTC 


AAAAATCGTA 


TTGGCACCAC 


CTCGTGTCAC 


1680 


AACAATATCA 


GCCAATTCCA 


TCAAGGGTTG 


ATAGAGATCG 


GTCACATAGT 


CAACACGAAA 


1740 


AAGATTTTGC 


CTCAACTCAT 


TCAGACTAGA 


ATCTCCAGTT 


AGATTGATAA 


TATTGTAGCG 


1800 


CTCTGTTAGT 


TCTTTCTTAT 


GGTCTGTCAC 


CAATTGGTTA 


AAGACACGAG 


CGCCTGCAGA 


1860 


ACCGCCAACA 


AACAATACAG 


TTGGCAATTT 


GGGATTAAAG 


TGGGTTTGAA 


TATCCACCAA 


1920 


TTCATCTGGT 


TCTGGAGTGT 


TTTTGTCCGA 


AACCTTGGTC 


ACCGCTCCCA 


CATGCTCAAC 


1980 


CTTAGCCAAA 


CTCGAAGCTT 


GTTCAAAGGT 


TGAATACATC 


TTAGTCGCAA 


ATTTATAGGC 


2040 


GATTTTATTG 


GCCAAGCCCA 


TAGACAGGTC 


AGATTCGTGA 


ATAAAGACAG 


GCACTCCTGA 


2100 


CACACGCGCA 


GCGATAACAG 


GCGGTACTGA 


GACAAAGCCC 


CCCTTTGAAA 


AAAGGGTCTG 


2160 


TGGACGCAGT CGCAACATGA TAAAGAGCGA TTGGACAATT 


CCCCAACCAA 


CTTTGAAGAC 


2220 
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GTCCAGCATA 


TTTTGCCAAG 


AGAAATAGCG 


ACGCAATTTT 


CCAGTCGCAA 


TAGAATGGAA 


2280 


GGTGACATCC 


AAACCTGACT 


TAAGGATTTC 


TTGGTGTTCG 


ATACCACACT 


TGTCCCCGAT 


2340 


ATAGTGGACT 


TCCCAACCAT 


CTTCGATGAA 


CTTGGGCATT 


AACAAAAGAT 


TGAGGGTAAC 


2400 


GTGTCCAACC 


GTCCCCCCAC 


CTGTAAAGAC 


AATTTTTTTC 


ATATTATTCT 


TTTAACTCCG 


2460 


CTACTGTGTC 


GATAAAGAGG 


TCGCCACGTA 


CTTCAAAGTT 


AGCATACATA 


TCCCAGCTAG 


2520 


- CATTGGCAGG 


ACTAAGAAGA 


ACCACATCTC 


CTTGAGTCGC 


AAGCTCATAG 


GCCTTGCGGG 


2580 


TCGCATCTGC 


AATATCTGTC 


GCCTCCACAT 


AAGCGACACC 


AGCCTTGTCT 


GCTGCCCGTT 


2640 


TGACACGTTC 


TGCAGATTGA 


CCCAGGATGA CCATCTTCTT 


GAGTCCAGTA 


ATGTCTGGCA 


2700 


CCAATTCGTC 


AAACTCATTG 


CCACGGTCCA 


AACCACCTGC 


AATCAAGACG 


ACCTTGCTGT 


2760 


TGTCAAATCC 


TGACAAGGCT 


TTTTGAGTAG 


CCAAGATATT 


AGTTGATTTA 


CTGTCGTTAT 


2820 


AGAATTTAAC 


ACCCTTGATG 


TCATCCACAA 


ACTGGAGACG 


GTGTTTGACA 


CCACCGAAGG 


2880 


CTGAAAGAGT 


TTCCTTGATG 


GTTTGATTGT 


CCACATCACG 


AAGCTTGGCT 


ACAGCAATAG 


2940 


TCGCAAGGGC 


ATTTTCCACA 


TTGTGGCTAC 


CTGGAACACC 


GATTTCATTC 


GCTGCCATGA 


3000 


CTACTTCACC 


ACGGAAGTAG 


AGTTGACCAT 


CTTCCAGATA 


AGCTCCATCA 


ACCTTTTCAA 


3060 


GTGTTGAAAA 


TGGTACAACA 


GTGGCTTCTG 


TCTTGGAAGT 


CAAGTCTTTT 


GCCAAGTCTT 


3120 


GATTAAAGTT 


CAAGACAAGG 


AAATCAGCTG 


CTGTCATCTT 


GTTCTGGATA 


TTCCACTTGG 


3180 


CTGCTACATA 


TTCCGAAAAT 


GACCCATGGT 


AGTCGATATG 


AGTTGGCATG 


AGGTTGGTAA 


3240 


TAACCGCAAT 


CTCTGGATGG 


AATTCTTGAA 


CACCCATGAG 


TTGGAAAGAA 


GAAAGTTCCA 


3300 


TAACAAGCGT 


GTCCTTATCT 


GATGCTATTT 


GAGCAACCTG 


ACTAGCTGGA 


TAGCCGATAT 


3360 


TCCCTGATAA 


AAGACCATGT 


TGGCCAGCAG 


CAGTCAAAAC 


TTCCCCAATC 


ATAGTCGTTG 


3420 


TGGTTGTCTT 


ACCGTTCGAT 


CCTGTGATAC 


CAATAATCGG 


TGCTTCTGAA 


ATCAAATAAG 


3480 


CCAATTCCAC 


CTCAGTCAAG 


ACTGGAATTC 


CCTTGGCCAA 


AGCCTTTTCA 


ATCATGGGAT 


3540 


TGTTGTAGGG 


GATACCTGGA 


TTTTTCACCA 


TAAGGGCAAA 


CTCTTCATCC 


AAGAGTTCCA 


3600 


AAGGATGGCC 


ACCTGTAATG 


ACCTTGATCC 


CTTCTTCCAG 


CAAACTTTGG 


GCAGCTGGAT 


3660 


TGTCCTCGAA 


AGGTTTCCCA 


TCATTTACTG 


TCACAATGGC 


ACCTAGCTTG 


TCCAACAAAC 


3720 


GAGCTGCAGA 


TTCACCAGAC 


TTGGCCAAAC 


CTAAAACAAG 


GACTTTCTTA 


TTTTTAAATT 


3780 


GATCTATTAC 


TTTCATGTCT 


CGAACTCCAT 


TTCTACTCCT 


ACTATTTTAC 


CATTTTTATG 


3840 


GAAATAAAAA 


AGCCACAAAG 


TGTGTTTGTG 


ACTCTTTCTT 


CTAACTGAAT 


CTTACCATAT 


3900 


CATCTATGTG 


ATAAATCGGT 


AACTCGAATG 


ACCTGATCCA 


CTTGCTCCCA 


AATCAGAGGA 


3960 
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iinivwulLu \*J\n L f\J\ I t\J\ i GGTCCGATTC 


GGATTTTTTA 


AAGATTCTAG 


i GATGGAAAGT 


4020 


nnx i i Lrtb Abl 111 rv»GG GTCTAAGGAA 


GCGGTTGGTT 


CATCTGCGAG 


GATCAAAGGT 


4080 


Vj^jA i 1 1 1 A AAATTATCTT CGCTAGTGCA 


ACACGTTGTG 


CTTCTCCTCC 


TGATAACTCA 


4140 


AATATAGGTT GCTTCAAATC CAAATAAGAG 


AGGTTTACAC 


GGTTTAGAGC 


TTGTTTCATC 


4200 


AAAGAGATTT TCTCTTTTTC CTTCAACTTT 


TTACCAACTA 


AACCCAGATT 


GAGATTCTCT 


4260 


TTGACGGTTT GGCTTTCAAT TAAGCCAAAA 


TCTTGAAATA 


AGTATCCTAA 


GTAATCTCTA 


4320 


AAGAAAACAG AAGGCTTGAT GTCCTTAAGA 


GAAGTGCCAT 


CATAGATGAT 


TTGCCCTTTG 


4380 


TCATATGGCT CCAATCGTCC AATCATATTC 


AAGAGTGTTG 


TCTTACCACA 


GCCACTTGTA 


4440 


CCGATTAAGG CATAAATTTT CCCACCTTCA 


AAATGAAGAT 


TCATATCTGA 


AAATAGCTGA 


4500 


CGGCTTCCAA ATTTTTTAGA TATATTCTTT 


AGTTCAATCA 


TCCTATTTTC 


CTTTCATAAT 


4560 


TGTCATAGAA ACACGAGATT CTTTCTGCGC 


TTGACGGTAA 


AGCGTCAAAA 


CTGCACTAGC 


4620 


TAGAAAGACC AATAAAGTGA GCAAGCCAAT 


CACCAAGTCT 


CGACTGCTTA 


AAATAAAGAG 


4680 


ACTAGCACCA AATACAAAAC TAGCAAATTG 


GCTAACCATA 


TACTGAGCAT 


GTGTTTCAAA 


4740 


AAATCGTAAA CCTGAAATTC GTTTAATCAA 


GATATCTCGG 


CGGAATTGCT 


CGAAATATAG 


4800 


AAGATTGACA GAATAAAAGA GTAACAAGGA 


ACTGGCTATT 


CCAACAATAG 


CTCCTAAGAT 


4860 


TAAAGTTGCT GTTTCAGTTT GAACTTCATT 


ATAACGAGTT 


AGATAAACAC 


TTCTTCCTTC 


4920 


TTTAAGATAG GATACTTGCT CATAAATTCC 


AGCTTTCTTC 


AAGAGTTCTA 


GCCCACTCTC 


4980 


ATATCCTTTG ATAAAGAGTT GTTTTCCAGC 


ATTGATAGAC 


CAACTAGATA 


AGGATATAAA 


5040 


ACTATCACCT GTAGAAGTCG GCGTGAATAC 


CACTAAAATC 


GGATCAGTCA 


AATACTGAGT 


5100 


AGATACGGGA TTCTCACCGT TATTATAAAC 


AAACCGCTTT 


TCTCCCATTG 


AAAGATAACT 


5160 


AACGTGCGCT TTCATCTCAT AATCCAAAGG 


AGCACTTGCC 


TCCTCACCAG 


ATTTTCCATA 


5220 


ATAACTCAAT CTTTCTTCAA AAACTTTCTT 


AAGTTCTGCT 


TCTCGAGAGC 


GCAAATGTTC 


5280 


TGGGAGCAAG AGGATAAACT CACCTTTTTG 


GAGATGGGCT 


AACTTCTGTT 


TGGTCTCAGC 


5340 


ATCTACCACG ACCTTTTCCT TGTCCAAATA ACTGGGACTA ACATAGAGCG 


TATTAGCATC 


5400 


TGAACTATAG GTATCCAGTG TCTCTCCCTG 


TTCATTTTTT 


CCTTGTGGAT 


TGGCAAAATG 


5460 


GAGCAGATTA TCCTTTACAT AAAGAGCTTG 


TTCTTCTTCG 


ATTGCTTCCT 


TGGCAAAGGC 


5520 


ATACCACTTG CTCTGATTTT CTGTATCTTT 


TCCTCTATCA 


CCTAAGCCAA 


AGGAAATCTG 


5580 


GTAATAGTCT GCTCTGTCCT GCCATGCTTG 


TTTTGAAATT 


TCAAGTTCTT 


TCAATCGTTG 


5640 


GTAAGACGTC AAACCTGTCT TAACAGCGTA 


GCCTACTGTA 


AAAACAGCTA 


CTAACTGACA 


5700 


CAATAGGGTT AAAGCCATCA AGCGTTTAAG GGGTAATCTT 


CCCTTAATAA 


CGGGAACTAA 


5760 
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TGCTTTGTAA CTCAAACTCA TTAGGTAAAG GAGCATTAGT AAAATTGAAA 


TCGCCAATAA 


5820 


AAACAACAGA TAGAAACTAA TCCCAAAACC ATAGGTGGCT AACAAGATAG 


GATAAAACAA 


5880 


ACCTTGACTA AAAAGAACGA CTCCCCCACC TAGGAAGGAA AGGAGGGCTG 


ATAGAAGGAG 




CCATTTGATA TCAGTAGATA AAGAATGCCC CATGATGGAT AAGAGAGTCT 


GACCAGAAAA 


6000 


GAGTTTTATA CCTGCTGCTC TCATTTCCTT AATCCGAGTG ATAATCACTA 


AAGCAAAGAA 


6060 


AGATAAGCCA AATATTGCTA AACTAATTAA AATAAGGGGA TTTAGTAATA 


TTPG A AAA fir* 


6120 


AAGAAAATAG GGCGGTATCT TTCGGTCAGC ACTTGCTTTA TAACCCAAAT 




6180 


ATCGGCAAGC TTTTCTTTCG TCAAGGAGCC TGACAAAAGG AGATAAfTAT 


TTAGCGGAnT 


6240 


ach<-l»iicac GACTTTCTTG GCTAGCTTCT TGGAATTCTT TTGGTAAAGT 


TCCCTGACCA 


6300 


TAAGTTGCAT AAGTAAAGTG AGTCGTCCCA TCCTTACTCG GCTCTACAAT 


TCTTCTAGCT 


6360 


ATTAAACTCT GTTCTGAGTT TGCAAAATTC TCCAATTCCT GTTCAAATAC 


CTCACGCGTC 


6420 


GGTTCCTGAG TATCTTTTTT GACACGAAGT AAAGAAACGG AATCATAGCT 


TGCATATAAA 


6480 


TATTGTGGCG CACGTAAGAC AATAATCCAA GCAAGGAAGA AGCTGAGAAA 


AAAAGTTGAT 


6540 


AATAATATGA ATAGTTTCTT CATAGTAGAC TCCTTGTAAA CAAAATTCCC 


CCTGTAATTT 


6600 


CTTACAAGGG GAACGATTTA AATCAATGAA CGATTAGTCA TAATCACAGT 


AAAATGCTAC 


6660 


TTGTTCTCCC CATTTAGTCC AAATCCATGC AGG 




6693 


<2) INFORMATION FOR SEQ ID NO: 196; 







(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 
CCGGTCTATG TACCCACTAC TTTGGGACAA TATGGGGATC AGCTACCCAA AACTAATCGA 
GCGTTTGGTT GACCTTGCCA AGGAAAGTTT TGACAAGCGC GACGATTTGA TATAAAATGA 
AAGAGAGGGT AGAAGCCAGA ACCATCACTG CACGGTGACT AGAGTTCTCG GACTTCAGCC 
CTTTTTAAAG GAGTAGAAAT GAAATTAACA ATCCATGAAA TTGCCCAAGT TGTTGGAGCC 
AAAAATGATA TCAGTATCTT TGAGGACACC CAGTTAGAAA AAGCTGAGTT TGATAGTCGT 
TTGATTGGAA CTGGAGATTT ATTTGTGCCA CTTAAAGGTG CGCGTGATGG CCATGACTTT 
ATTGAAACAG CCTTTGAAAA TGGTGCAGCA GTAACCTTGT CTGAGAAAGA GGTCTCAAAT 
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CATCCTTACA 


TTCTAGTAGA 


TGATGTTTTG 


ACAGCCTTTC 


AATCCTTAGC 


ATCCTACTAT 


480 


CTTGAAAAAA 


CGACTGTTGA 


TGTCTTTGCT 


GTTACAGGTT 


CAAATGGCAA 


GACAACGACT 


540 


AAGGATATGT 


TGGCGCATTT 


ACTGTCAACA 


AGATACAAGA 


CCTACAAAAC 


ACAAGGCAAT 


600 


TACAATAATG AGATTGGCCT 


TCCTTACACA 


GTTCTTCATA 


TGCCTGAAGG 


AACAGAAAAG 


660 


TTGGTTTTGG 


AGATGGGACA 


GGATCACTTG 


GGCGATATTC 


ATCTCTTGTC 


TGAATTGGCT 


720 


CGTCCAAAAA 


CAGCCATCGT 


GACCTTGGTT 


GGAGAAGCCC 


ATTTGGCCTT 


TTTCAAAGAC 


780 


CGTTCAGAGA TTGCTAAGGG AAAAATGCAA ATTGCAGACG 


GAATGGCTTC 


AGGTTCCTTG 


840 


CTTTTAGCGC 


CGGCTGACCC 


TATCGTAGAG 


GACTATTTGC 


CAACTGATAA 


AAAGGTGGTT 


900 


CGTTTTGGGC 


AAGGGGCAGA 


GCTGGAAATT 


ACTGACTTGG 


TTGAGCGCAA 


AGATAGTCTG 


960 


ACCTTCAAGG 


CCAATTTCTT 


AGAGCAAGCC 


CTTGATTTGC 


CAGTAACTGG 


CAAGTACAAT 


1020 


GCGACAAATG 


CTATGATTGC 


ATCCTATGTT 


GCCTTGCAAG 


AAGGAGTTTC 


AGAGGAGCAA 


1080 


ATTCGTTTGG 


CCTTCCAAGA 


TCTTGAATTG 


ACGCGTAACC 


GTACCGAGTG 


GAAGAAAGCA 


1140 


GCCAATGGAG 


CAGATATCCT 


GTCAGATGTT 


TACAATGCCA 


ATCCAACTGC 


TATGAAACTG 


1200 


ATTTTAGAGA 


CTTTCTCTGC 


CATTCCAGCC 


AATGAAGGTG 


GCAAGAAAAT 


TGCAGTGTTG 


1260 


GCGGATATGA AGGAGCTTGG TGACCAGTCT 


GTTCAACTTC 


ATAATCAGAT 


GATTTTGAGC 


1320 


CTTTCTCCAG 


ATGTGCTTGA 


TACCGTGATT 


TTCTATGGAG 


AAAATATTGC 


TGAATTAGCC 


1380 


CAATTGGCCA 


GTCAAATGTT 


CCCAATCGGC 


CACGTTTACT 


ACTTCAAGAA 


AACAGAAGAC 


1440 


CAGGATCAAT 


TTGAAGACCT 


AGTCAAGCAG 


GTCAAGGAAA 


GCCTTGGAGC 


CCATGACCAA 


1500 


ATCCTGCTCA 


AAGGCTCTAA 


CTCTATGAAT 


CTAGCCAAGT 


TGGTAGAAAG 


TTTAGAAAAT 


1560 


GAAGACAAGT 


GATTTTGTCA 


AGTATTTGCA 


AAGAATGATT 


GCCATTACAG 


ATACTGGCTT 


1620 


AACCTTTACA 


AAAGATCCGT 


TTGACCGTGA 


GCGCTACGAA 


GACTTGCGAA 


GTCTGTTATC 


1680 


TGAAATGTTG 


AATCAAGCAT 


CAGACCTTGA 


TTCCGAAGAA 


GTGGCAGAAG 


TCTTGAAGCC 


1740 


AACTTCTGCT 


TATGCGACTC 


CGTTAATGGA 


CGTCCGTGCT 


TGGATTGTTG 


AGGATGAGAA 


1800 


GATTTGTCTG 


GTTAGGGGAC 


AAGGAGAGGA 


TAGTTGGGCT 


TTGCCGG 




1847 


(2) INFORMATION FOR SEQ ID NO: 197: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



4 
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CAAGCGAAAA 


CATTTTTTAT 


TCCAAATAAA CAGAGCATTT 


TAGGAGAACA 


AGAGATTTTG 


60 


AATGCCAAGT 


CGATCTTGGC 


CTTGCTAGAC GGTTTGGAGT 


CACATAGCTA 


TGATGTAGTC 


120 


TATCTCCGTC 


AGCCTCTTAA 


TCGTCTCGAA TATATCGAGT 


GTGCGATAGT 


GGGGCAATCA 


180 


CAATTTCTCT 


TTAAGGTCAG 


TTATGCTGAT GGTCAAAAGG 


CTTACCGTGT 


CGATCTTCCT 


240 


GACCTACTAA 


CAAAGACAGA 


CTGGCAGATT ATCAAGTCAT 


TTTTAGATGC 


TTTGCTTGCT 


300 


" TATACAGGGA 


CTGATATTGA 


AGGGCTAGAT GGTTTTGATT 


TTGAAGCTTA 


TTTCCAAGCA 


360 


AGTATTCAAG 


CCTATCTAGC 


AGACCCTGTA GCTCGTTTTA 


CGATTTGCCA 


AGGAATTTTT 


420 


AATCCTATTT 


TCTTTAGTCG 


TGAGAACTTG AAAAGCTTTT 


TAGAGGCAGA 


TGGCTTGGCT 


480 


CAGTTTGAAG 


CGCGTGTGCG 


TGCGGTTCAA GAGACAGATG 


CCTACTTTGC 


GAGAGTTTCC 


540 


TTCTATCAGG 


ATGGAGAAGG 


AAAAGTGCAT GGCGTTTACC 


ATCTAGCTCA 


AGGAGTCAAG 


600 


ACAGTTTTAC 


CGAGAGAACC 


GTTTGTTCCT GCAGCCTATA 


TTGAGCAATT 


GGTGGATAAG 


660 


GAAGTCCAGT 


GGGAGATTGA 


CTTGGTTCAA ATCACAGGAG 


ATGGCTCTAA 


ACCAGAAGAC 


720 


TATGAAGCCA 


TTGCTCGCTT 


GGACTATGCA AAATTCTTAG 


AGGTATTACC 


CCCATCTTTT 


780 


TACCACCAAC 


TAGACGCCAA 


TCAAATAGAA GTGCAACCCA 


TATTAGACAA 


AGATTTTAAA 


840 


ACATTAGCAC 


AAGAAAAGTA 


AAGCAGAAGC AGGTCAATCG 


ACTTGCTTTT 


TTGACATAGA 


900 


AAAAATCCTG 


CCAAGaTGAC 


AGGATTGCTA CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


960 


GCTAGCCGCA GCTGTACTTG AGTACGGTAA GGCGAAGCTG 


ACGTGGTTTG 


AATTTGATTT 


1020 


TTGAAGAGTA 


TGAAGTTTAA AGAAAAGCCA AGATACGAAG 


AT 




1062 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TATCTACAAC CTCAAAAACA TGTTTTGawG gCTCGTCAGT cTATCTACAA CCTCAAAAAC 60 

ATGTTTTgAa kGCtcGTCAG tTCTATCTAC AACCTCAAAA ACATGTTTTG AcaGCcTcGT 120 

CAGTTCTATC TACAACCTCA AAAACATGTT TTGAGCTGAC TTCGTTAGTT TCATCTACAA 180 

CCTCAAAAAC ATGTTTTGAG CTGACTTCGT TAGTTTCATC TACAACCTCA AAAACATGTT 240 

TTGangnCnT CGTCAGTTCT ATCTGCAACC TCAAAGCAGT GCTTTgagcG CTTCGTCAGT 300 



J 
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TCTATCTACA 


ACCTCAAAAC 


AGTGTGTTGC 


GCAGCCTTTA 


ATCAGCCGCC 


TAGTCCGCTC 


360 


TATGGTATTC 


ATTAAGTCAA 


CATCTCTTGT 


TTAAGAGCAC 


CAAATCAGGA 


AATCTTCTCG 


420 


ATTCCCTGAT 


TTTTTCTATT 


TACGTTTTCG 


TGTTGAGCTA 


CGTTCTGTCA 


AACCATGAGG 


480 


TAAGAGAACT 


TCACGTTCTT 


CCAACTCTTC 


CTTATGCATA 


ATCTTGGTCA 


ACATACGCAT 


540 


ACTAATGGCA 


CCAAGGTCAT 


AAAGAGGTTG 


GGCAATCGTT 


GTCAAGTTTG 


GACGGGTAAA 


600 


GCGTGAGATT 


TGTGAATCAT 


CACTAGTAAT 


AATTTCAAAA 


TCTTCTGGCA 


CAGAAACACC 


660 


CTTATCAGCC 


AAACCGTTCA 


AGACTCCTGC 


TGCCAACTCA 


TCACCTGTCA 


CAACTGCTGC 


720 


AGTTGCATTT 


GATGAAATCA 


AACGCTCTGC 


TAAGGCGTAA 


CCATCATCAT 


AGCTATATTT 


780 


AGATTCAAAT 


ACCAAACCCT 


CACTATAAGT 


GATTCCTGCT 


TTTTTCAAGG 


TTTCCTTGTA 


840 


GCCAACTAAA 


CGAACCTTAC 


CATTGATGTC 


ATCCACTAGC 


GGACCGCTAA 


CGAAAGCAAT 


900 


ACGCTCATTT 


TCTTTAGCAA 


GGTAACTCAC 


TGCATCAATT 


GTTGCTTGCT 


TATAGTCAAT 


960 


ATTGACACTT 


GGCAACTGGT 


GCTCAACATC 


GACAGTTCCT 


GCGAGAACAA 


TCGGAGTACG 


1020 


TGAACGCGAA 


AATTCTGAGC 


GAATTTTATC 


TGTCAAGTGA 


TACCCCATAT 


AGATAATGCC 


1080 


ATCTACCTGC 


TTTGAAAAGA 


GGGTATTGAC 


AACAGAAACT 


TCTTTCTCGT 


TATCTTCATC 


1140 


GCTATTAGCT 


AGGACAATAT 


TGTACTTGTA 


CATTTCTGCA 


ATATCATCAA 


TCCCCTTAGC 


1200 


CAAACTCGAA 


AAATAACCAT 


TGGTAATATT 


TGGAATCACG 


ACACCGACAG 


TGGTTGTCTT 


1260 


TTTACTTGCA 


AGACCACGCG 


CAACTGCATT 


TGGACGATAA 


TCCAAACGAT 


CAATTACCTC 


1320 


TAGCACTTTT 


TTACGGGTAT 


TCTCTTTTAC 


ATTTTTATTG 


CCATTGACCA 


CACGGCTGAC 


1380 


CGTCGCCATG 


GAAACACCTG 


CTTCACGAGC 


GACATCATAA 


ATGGTTACTG 


TATCATCTGC 


1440 


ATTCATTCCT 


TTTCCTGTCC 


TTTCTATCTC 


ACACATTCTT 


TTACAAGTAG 


AGGTACTGAT 


1500 


TGAAGCTCTA 


TATCTACTTA 


CAAAAGTGAA 


GATGTGAAAA 


TTTCGTTTTC 


ATATTTCTAC 


1560 


TTATTCCATT 


CTATCACTAA 


TTGTAAACAC 


TTTCAAGTGT 


TTTTTGAAGA 


TTGATTGAAA 


1620 


AAATTTCATA 


GAAAACCTAG 


GTTTAGCTCC 


TTGCTACCAC 


CTTAGACTAA 


ACAAAAAGGA 


. 1680 


GGAAACTAAG 


CCCTCCTAAA 


GTTATAGTAA 


AATGAAATAA 


GAACAGGATA 


AATCGATCAG 


1740 


GACAGTCAAA 


TCGATTTCTA 


ACAATGTTTT 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


1800 


TCTACTATAG 


GTATTGTTCC 


ATTCACTACC 


GTCAATTTTA 


GCACATAGTC 


TTCATGAAAA 


1860 


TATTATATCA 


TCATAACCAA 


CCAGATTCTT 


TCGCGATATT 


AGCTGCCTCT 


GTTCGATTAC 


1920 


CTGCATCTAG 


TTTCGAAAGA 


ATATTGGTGA 


CATAGTTTCG 


GACTGTTCCG 


TTGGATAGAT 


1980 


AAAGTTTGTC 


TGCAATTTCT TGGTTAGAGA AGCCCTGAGC AATTCCCTTT AAAACTGCGA 


2040 


TTTCTTGCTC 


CGTTAATGGA 


TTGGGATGCA 


TCATCACCAC 


TTCCATCAAT 


TCAGGCGAAT 


2100 
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ACTCCTTGCG 


TCCTTCGAGG 


ACGGTGTGCA 


AGGTTTGCAT 


GAGGTCTGCA 


ATGTTTCTTT 


2160 


CTTTTAATAC 


ATAAGCATCT 


ACTCCAGCCT 


TGACCGCACG 


TTCAAAATAC 


CCAGGACGCT 


2220 


TGAAGGTCGT 


CACCACAACC 


ACCTTTGTTT 


CAAGCTTTTC 


TGCTCGTATC 


CACTCCAAGA 


2280 


CTTCAAGACC 


TGTCTTAACA 


GGCATTTCTA 


CGTCAAGGAT 


GGCGATATCT 


ACAGACTCCT 


2340 


TTTCTAATAG 


TTGGATTGCT 


TCTTGCCCAT 


TCTTGGCTTG 


AAAGACAGAC 


TCTACATCCG 


2400 


GTTGAAGCAT 


GAGCAACTGG 


CACATGGCAT 


CTCGCAACAT 


ACTTTGATCT 


TCTGCGACTA 


2460 


ATACTTTCAT 


CTACTTTCTC 


TCCTTATAAA 


GTAGTCGAAC 


CTGCACTTCA 


GTTGGATGTT 


2520 


TCTGACTGAT 


TACACTTACT 


TCTCCTGAAA 


ATGGAAAAAC 


ACGATTTCGG 


ACTGTATGGA 


2580 


GCTCATCCCC 


GCTTATAGAG 


GCAAAGCCAC 


AGCCATCATC 


TCTCACTGTT 


AGAATGAGTT 


2640 


CTTTCTCTGT 


CCGTTCTAAT 


TTCAAGTAGA 


CTTTAGACGC 


TTTAGCATGT 


TTGATGATAT 


2700 


TGGTCACTAA 


TTCAAGCAAA 


ATCATGGAAG 


CCGTTGACTC 


CAATTCCTGA 


GTTAAGCTAG 


2760 


ACTTGTCCAA 


GTGATTCTCA 


ACTTGAACCT 


CAATTCCAGC 


AATTTCTAAC 


ATCTTTTTCA 


2820 


CAGTCTCTAG 


TTCGGATGTC 


AAAGTTCTAG 


ACTTAAGATT 


TTCCACAATG 


GTTCGCACTT 


2880 


CATTCATGGA 


tCCTTGCTGA 


TCTGGTGAAT 


TTCTTTTAAT 


TCCTTTTCCA 


CCTGTGGATA 


2940 


AGCCTCCATC 


TGAAATAACT 


GCAAGGCTAA 


ATCTGTCTTG 


ACACTCAGCA 


TAGCAAAGGT 


3000 


ATGTCCCAGA 


CTATCATGCA 


AATCCTGACC 


GATACGACTA 


CGTTCATTTT 


CAGCAAGCAA 


3060 


TAGATTTATC 


TGAGCATTTT 


GCTTGACCTG 


AGCTTCTTTC 


AAATCCTCGA 


CAATACGAAT 


3120 


CCGAACCAAT 


CCAAAAGTCA 


TTAAATCGAC 


AAAAGTAAGA 


ATTACAAGTA 


GATAGAATAG 


3180 


AAACTCAACT 


TCGATTCTCT 


GAAAAATCAA 


CAGTTGCCCC 


ACAACAAGGA 


CTTGAGCAAG 


3240 


AAGAAAAGTC 


CAGACATGTA 


AAGACTTTAA 


ACTACGTACG 


CTGAAATGAT 


AACTTAAGAG 


3300 


ATTGGATAGG AAAAAGAAAA ACCAGATATA ATTAACAGCA ACAAAGGCAG TATTCCCAAC 


3360 


TACATAAGTC 


AGCATGAGGC 


CCCAATATAG 


CCAAGATAGG 


CGCTGGCTCT 


TAGTTGTTAA 


3420 


AACACCCAAA 


TATGCCACTA 


CAAATAGAAT 


ATCAATCAAT 


AAATGCCAGG 


CAGAAAGCCA 


3480 


CCCAGTCACT 


ACAGACAGGA 


TGGGGAAAAT 


CATAAAAATT 


AAACTGATCC 


AAAACATATA 


3540 


ATGTATTCTT 


TTCAGTCTTT 


CAAGCATTAA 


GCATTCTCCT 


TATGACCTTG 


AAGGTAAATG 


3600 


GTCAAACCAA 


ACAAAACTAC 


TGAAAAAACA 


AGTAAATAAA 


CTGTGGCTGA 


TAGATTGATG 


3660 


CCACCCTCAT 


TTAAGAAGGT 


CTTGAGCAAC 


TCCATCAACT 


GATAGGTCGG 


GAGACACTTA 


3720 


CCTACTACTT 


GCATCCAGTC 


TGGAAATAAA 


GAGATAGGCA 


TCCAGAGTCC 


ACCTAAAACA 


3780 


GCCAACCCTA 


GATAAAGAAG 


ATTGCCCACG 


ACAGACATCA 


ACTGACTAGT 


TGGTAAGAGA 


3840 



i 
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GTCAAGGTCA AACCAAGCGC TACGAAGGCA ATACTTCCTA CTATCAGCAA AAGTGCAGCC 3900 
CCAATCCAAT TTCCAAGAGA CATGTCCACA CCTCTTACAA AATGCCCAAC TGAGAAAACC 3960 
ACCAAGATTG AAACCAAATA ATCAACCAGC ATACTTGTTA TCTTTGATAG ATAATATTCT 4020 
ACCATATTTA CAGGGCTATG ACGCAATGTT TTCTGCCAGT TGTTGATCTT GTCGGTATGT 4080 
AAAACAACTG GGAATGAGAA GATAGCTGTT GACATCATGG AAAATGCAGT CATGGAGATA 4140 
AGATAATCAC GCATAAAATT CGCGAGTTCA CCTGGTGTGT CCTGATAGAT ACCAGAAAAA 4200 

AATAAATAGA AAGCCGTCGG CATCCCTACT GACAATAGAT AATAGATCAA TTGTCGTTTG 4260 

GTCAATAAAA ATTCTATCTT ACTAAGTGCT AGCCATCGTT TCATCTTAGT TATCTCCCTT 4320 

CTGCGTTTCT TCAAAGATTG TATCCAACAA ACTACGATTA TTAACTTCAA TTTCTTGTAT 4380 

GCCACATCCT GCTTGAACTA ACAGTTCCCA AAAAGCATCT GCTTCGCGTG TGACTACTTG 4440 

TAGAGCATCC TGTTTTTGTG ACCAGTTTTC AACCAAGTTA GACTGCTCAA TGACTTCCTT 4500 

GTATGCCAGA GGAAGGATAA AATGCTTTTC AATTCCCTCA CTACGCATAG CTAGAGGCGT 4560 

CGTATCACGA ATCAACTCTC CCTTATTTAA AACCAAAATC CGGTCAGCCG TATGCTCTAC 4620 

CTCTTCAATA TAATGAGACG AATAGAGAAT CGTGACTCCT TGCGCTTTTA GGTCCCGAAC 4680 

GATTTCCCAA AAGCGTTGAC GAGTTGAAGT ATCCATGGCA GCAGTTGGTT CATCTAAAAA 4740 

GACAAGCTTT GGTCGCCCAA TCAAGGTCAA GACAAAAGAG AAGAGACGCT TTTGCCCGCC 4800 

TGACAATTTT TCTGCGAATT GCTCTTTTTG TTGCTGGTCA AACTGCAATA GTTGATCGAT 4860 

TTCCTGATCG CTCAAGGAAT TTGGATAGAT ACGTTGAAAG AAAGCAATCA ACTCTTTGAC 4920 

CTTTAATTTC TGAACGATGA CATTTTCTTG AGGCAGATAA CCTCTAATAT AGTCTAACTG 4980 

AGAACTCGTC ACTGACAAGC CTTGGATGGA TACTTGACCG CTTGTGACCA GTTTATCTCC 5040 

AAGCAGACAG TCCAAGAGTG TGGTCTTCCC AGCACCATTG GGCCCAATCA AGGCGACGCA 5100 

TTCACCTTCA GCTACCTCAA AGGAAATACC CTTCAAAATA GCCTTGCCCT TGATGTTTTT 5160 

ATTTAGGCTT TCTACCTTAA TCATATTCAT GATATTCTCC TTTCAACCAC TCCATTCTCA 5220 

TAAGGAAAAC GACGAAAATC ATAAATCCAA ACCCCAAAGC ACCACGAATG AATTGGCGAA 5280 

gCAAGGTTTG GTCAAACCAA CCTGTAAACA TTTCCACTAA CCATACCAAG AGTGACAGGC 5340 

CGATAAAGAA ATAGATGATC CCTCTCTTCA TTCCTCAAGC TCCTTTTTCA CATCTCCGAC 5400 

TAATTTCAAA CCTTCTCTAA CAAGCCAAGA CATCATTCCA AAGCCAGCAA AGAGCTCCCA 5460 

AGGAAAATGA TAGAAACTCT CATCCAATCC CGAAAACATG AGTTAGGTCA TAACTCCTGC 5520 

TACTACTAAA CTCACTGCGA TAATCATTTT ATTTCTCATC TCTTCTTCCT CCATTTCATA 5580 

CTACAATTAT AGTCTTTTGA AATCAGAGGA GACAGAAGCT TCTGTCACTA GAAAATATGA 5640 
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CAAATGTCAT AAAAAATTCT GTTCAAAACA AGCAAGATAC ACTATACAAT AAAACACAAT 5700 

TAGAAAAATC TAAGGCAACT TCCTCAAAAG AGATATCAAA CCCAATTCAC ACCATAATGT 5760 

AAACTAATAC TTATTTAAAA TCAAAAAGAG TAGAAATTTT TATCAGACAA ACACATATAT 5820 

AGTGTATTGA ATCTATAACA GTAGGCCTTA AATACTAAAA TATTTCTATA AATTAATTTA 5860 

ACTTTCCTGA TAGAGCTGTT CATATCTTAT TTCAATTCTC TAAATTATAC GTTGAACAAA 5940 

ACCCTTCTAT TTCTTTCTTA AAGATTTATA AGAGTTATAA AATCTGTTAA ATTTCAATGT 6000 

GTATACCTAA ACTACGGTAT TTATTGAAAA GACTGGAGAC AAAAAGTATA CGCTGCCAAA 6060 

ATGAATTACT GAAAATCAAA AAAGAGAGAA CCAAACTGAT TCCCTCTTAA TGTATATAAT 6120 

ATCTAGTTTT AAAAATACAC ACTCACATAT CTCTGTAATG AATCGGGAAG ACAGGATTCG 6180 

AACCTGCGAC ACCTTGGTCC CAAACCAAGC ACTCTACCAA GCTGAGCTAC TTCCCGAGTT 6240 

AAATAGAAAA ATGCACCCTA GAGGAGTCGA ACCTCTAACC GCCTGATTCG TAGTCAGGTA 6300 

CTCTATCCAG TTGAGCTAAG GGTGCTCCAT ATTATGCCGA GGACCGGAAT CGAACCGGTA 6360 

CGATCGTTAC CAATCGCAGG ATTTTAAGTC CTGTGCGTCT GCCAGTTCCG CCACCCCGGC 6420 

CTCTCTAAGC GAACGACGGG ATTCGAACCC GCGACCCCCA CCTTGGCAAG GTGGTGTTCT 6480 

ACCACTGAAC TACGTTCGCA CTGTTTTCTT CTATCTAAAA ATGCCGGCTA CATGACTTGA 6540 

ACACGCGACC CTCTGATTAC AAATCAGATG CTCTACCAAC TGAGCTAAGC CGGCTCATTT 6600 

GTTATATCTT AATGCGGGTT AAGGGACTTG AACCCCCACG CCGTTAAGCG CCAGATCCTA 6660 

AATCTGGTGC GTCTGCCAAT TCCGCCAAAC CCGCATATAT GACCCGTACT GGGCTCGAAC 6720 

CAGTGACCCA TTGATTAAAA GTCAATTGCT CTACCAACTG AGCTAACGAG TCTAAAATAA 6780 

CTTGCGTTAC CTTAAACGGT CCCGACGGGA ATCGAACCCG CGATCTcGCC GTGACAAGGC 6840 

GACGTG 6846 
(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

GAATTCATTT TAAATAAAGA TACGGGAGAG GTAAGTGAAT TAAAACCTCA TAGGGTAACT 60 

GTGACCATTC AAAATGGAAA AGAAATGAGT TCAACGATAG TGTCGGAAGA AGATTTTATT 120 
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TTACCTGTTT ATAAGGGTGA ATTAGAAAAA GGATACCAAT TTGATGGTTG GGAAATTTCT 180 

GGTTTCGAAG GTAAAAAAGA CGCTGGCTAT GTTATTAATC TATCAAAAGA TACCTTTATA 240 

AAACCTGTAT TCAAGAAAAT AGAGGAGAAA AAGGAGGAAG AAAATAAACC TACTTTTGAT 300 

GTATCGAAAA AGAAAGATAA CCCACAAGTA AACCATAGTC AATTAAATGA AAGTCACAGA 360 

AAAGAGGATT TACAAAGAGA AGAGCATTCA CAAAAATCTG ATTCAACTAA GGATGTTACA 420 

GCTACAGTTC TTGATAAAAA CAATATCAGT AGTAAATCAA CTACTAACAA TCCTAATAAG 480 

TTGCCAAAAA CTGGAACAGC AAGCGGAGCC CAGACACTAT TAGCTGCCGG AATAATGTTT 540 

ATAGTAGGAA TTTTTCTTGG ATTGAAGAAA AAAAATCAAG ATTAAGATAA AAGCTATAGA 600 

AAAAAATGGT TTATGTACTG AGATTAGATA GTGAGGTGAT GACATAGTTT TGTGAAAATA 660 

GCCATTTATA ACTCAATTAT TTAGTTTACT TTACTTTACT AGTGATACTA TTTGGAGTTA 720 

TTAATGGACT TAGTTTATAT AACTAATGAA TTGATTGAAA GGGTTAGTAT TGACAATATT 780 

GGTCATATTG ACTAGAAAAT AGAGTCTATC AAAATTTAAA GGCTAATAGA GGTGATGAGA 840 

CAATTTCGGC TCTTTGTCAA CTGTAGTGGG TTGAAGTCAG CTAAGCTCGA GAAAGGACAA 900 

ATTTTGTCCT TTCTTTTTTG ATATTCAGAG CGATAAAAAT CCGTTTTTTG AAGTTTTCAA 960 

AGTTTCGAAA ACCAAAGGCA TTGCGCTTGA TAAGTTTGAT GAGATTATTG GTCGCTTCCA 1020 

GTTTGGCATT AGAATAGTGT AGTTGAAGGG CATTGACAAT CTTCTCTTTA TCTTTGAGGA 1080 

AGGTTTTAGA GGATGAACTT GATTCAGATT GTCCTCAATG AGTCCGAAAA ATTTGTCAGG 1140 

CTCCTTATTC TGAAAGTGAA AAAGCAAGAG TTGATAGAGA TTATAGTGGT GTTTCAAGTC 1200 

TTCTGAATAG CTCAAAAGTT TATCTATAGT AGATTGAAAC TAGAATAGTA CACCTCTGCT 1260 

TCTAAAACAT TGTTAGAAAT CGATTTGACT GTCCTGAATG ATTTGTCCTG TTATTATTTC 1320 

ATTTTACTAT AAATCCACGT TTACGAATCT CTTTCCACAC TTGTTCAATG GGGTTCATCT 1380 

CTGGTGTGTA TGGAGGAATA AATGCAAAAC CAATATTAGT CGGAATCTTT AAGGTACTTG 1440 

ATTTATGCCA TATAGCATTG TCCATAACGA GTAAAAGATA ATCATCTGGA TAAGCTTGTG 1500 

AAAGCTCCTA TTCCTAAAGC CCCTTTATAA CCTCTTGCGA GAGAGACTAT TGACTCAGCC 1560 

CTTACTTCAT GCGGATGAAA CTTCTTATCG GGTTCTAGAG AGTCATAGCC ATCTGACCTA 1620 

CTATTGGACC TTTTTGTCTG GGAAAGTTGA GAATCAAGCA ATCACGCTGT ACCATCATGA 1680 

TCAGAGTCGG AGTGGTTCGG TAGTACAAGA ATTCCTAGGA GATTATTCTG GCTATGTTCA 1740 

TTGTGATATG TTGCGGCAGT AACTTAGGAC TTTAGTCCTC TAGTTCTGCC TATGCGATAG 1800 

CAGTCCAAGG TTTAGGAGCA AGGCGACGCT AAGCTTGGTA AACTGCGAAC CGCTAGAAGC 1860 

TTATCGTCAA CTGGAAGAAG CTGAACTTGT TGGATGTTGG GCGCATGTGA GAAGGAAATT 1920 
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TTTTGAAGCG ACCCCCAAGC AAGCAGATAA ATCATCCTTA GGAGCTAAAG GTTTAGCTTA 1980 

TTGTGATCAG TTATTTTCCT TGGAAAkAGA CTGGGAGGCT TTGCCAGCTG ATGAACGACT 2040 

ACAGAAACGT CAAGAACATC TCCAGCCCCT AATGGAAGAC TTCTTTGCTT GGTGCCGCCG 2100 

TCAGTCAGTT TTAGCAGGTT CAAAACTAGG AAGGGCAATT GAATACAGCC TCAAGTATGA 2160 

AGAAACCTTT AAGACTATTT TGAAAGACGG ACATCTGGTC CTTTCCAATA ATCTAGCTGA 2220 

ACGCGCCATT AAATCATTGG TTATGGGACG GAGTAAAAGA GTCCAGTGGA CTCTTTTAGC 2280 

CTGAGCTCAG TTTAAAAAAG CGAGGGTGGT TATTTTCTCA AAGTTTTGAA GGAGCTAAAG 2340 

CAAGAGCTAT TGTTATGAGC TTGTTGGAAA CAGCTAAACG TCATCAATTA TAGTGCGTTG 2400 

AATCTATAAC AGTACGCATC GACTGCTAAA ACATTTCTAT AAATCAATTT TCCTTTCCTA 2460 

ATCGATTTGT TCATATCTTA TTTCAATCCA TTATAAATAG CGAGAAATAT CTATCCTATC 2520 

TTCTAGAATG TCTTCCAAAC GAGGAAACTC TCGTAAACAA AGAGGTTTTA GAGGTTTATT 2580 

TACCATGGAC TAAAGTTGTA CAAGAAAAGT GCAAATAAGA AATCTCCAGA TTAGGAACTA 2 640 

TCCGTGAGTT CACTAATCTG GAGATTTTTC AATAGAtTCG TTATTGGGCG GTTACGATAT 2700 

GATcACTACT TCGTCAGTCT TATCTACAAC CTCAAAACAG TGTTTTGAGC AACCTGCGAC 2760 

TAGCTTCCTA GTTTACTCTT TGATTTTCAT TGAATATTAG AACAGAAAAA ATGCTTGGAG 2820 

TATTTGTTTG TGTGTTTATT TTTATATAAC AAACTATAAA CAAAATAAAA ATATAAAAAA 2880 

AGAGACAAAA AAGAACAGAA AGTAATTGAC A 2911 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

GAAAATAAGT CTTGACAGAA AGCGCTATCA ATGATAGAAT GAATTCAGAT AAAAAGATTT 60 

ATTTTTAAAA CAAAAATGAA ACGTTTCAAA AAAAGAAATA AAGAGACAGC GCCAAGCGCT 120 

ATCTTTTCTA GAAAAAAATG AAACGTTTCA AAAAAGGAGG TTGCTATGAA TAGCAAAGCG 180 

AAGCAAGTTT CTCTTTGGGA AAGAATCAAG AAACAAAAAC TCTTGTTATT GATGACTGTC 240 

CCCGGTTTAG TTTTAACCTT TATCTTTAAA TACATCCCTA TGTATGGGGT TTTAATCGCA 300 

TTTAAAGATT ACAATCCTTT AAAAGGAATT TTAGGGAGTG ATTGGATTGG TTTTTCTGAG 360 
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TTTACAAAAT 


TCATATCCTC 


TCCCAACTTT 


GGTATCTTGT 


TAGCCAACAC 


ATTAAAATTA 


420 


AGTATCTATG 


GTTTATTGCT 


TGGCTTTTTA 


CCACCAATCA 


TTCTCGCGAT 


TATGCTCAAT 


480 


CAACTCTTGA GTGAAAAAGT 


CAAAAAACGA 


ATTCAGCTCA 


TTTTATACGC 


ACCAAACTTT 


540 


ATCTCAGTCG 


TTGTTATTGT 


CGGTATGATT 


TTCCTCTTCT 


TTTCAGTGGG 


AGGACCAATC 


600 


AACAATTTTC 


TTTCTATGTT 


TGGAATGAAG 


GCTGACTTCT 


TGACAAATCC 


AGACTTCTTT 


660 


AGACCTTTAT ACATCTTTAG TGGTATCTGG CAAGGAATGG GCTGGGCTTC 


AACGCTCTAC 


720 


ACGGCAACAT 


TGGTAAATGT 


AGATCCAGCC 


TTAGTAGAAG 


CAGCCCGACT 


GGATGGAGCC 


780 


AATATCTTCC AACGAATCTG 


GCACATTGAT 


ATTCCAGCTC 


TTAAGCCTAT 


TATGGTTATC 


840 


CAATTTGTTT 


TAGCTGCAGG 


TGGAATTATG 


AATGTCGGAT 


ATGAAAAAGC 


ATTCTTGATG 


900 


CAGACATCGT 


TAAATTTGCC 


AACTTCTGAA 


ATTATCTCGA 


CATATGTCTA 


TAAAGTTGGT 


960 


CTTGTATCAG 


GAGACTATTC 


TTACTCAACA 


GCGGTTGGTT 


TGTTTAATGC 


AGTGATTAAC 


1020 


GTAGTATTGC 


TTGTTGCAGT 


TAACCAAATC 


GTTAAACGCA 


TGAATAATGG 


TGAAGGAATT 


1080 


TAAGGAGGAA 


AGTATGAAAA 


ATTCGATTAT 


GGATACAAAA 


TTTGATAGAC 


GTATCTTACT 


1140 


CTTAAATAAA 


ATCATTATTG 


TCTTTATCGT 


TTTGATGACT 


TTGCTTCCTT 


TACTTTATAT 


1200 


CGTCGTAGCA 


TCCTTTATGG 


ATCCTAAGGT 


TCTGGTTAGT 


AGAGGGATTA 


GCTTTAATCC 


1260 


AGCCGATTGG 


ACTGTAGAAG 


GTTACCAGCG 


TGTATTCAGT 


GACCAATCTA 


TTCTAAGAGG 


1320 


TTTTATCAAT 


TCTCTACTAT 


ACTCTTTTGG 


ATTTGCAGCT 


TTAACAGTCT 


TGCTATCTGT 


1380 


GTTTACAGCT 


TATCCTCTTT 


CTAAGAAAGA 


CTTGGTTGGA 


CGTCGTTGGA 


TTAACTACTT 


1440 


CTTGATTGTA 


ACTATGTTCT 


TTGGTGGTGG 


TTTAGTCCCA 


ACTTACTTGC 


TCGTAAAAGA 


1500 


ATTGGGAATG 


CTCAATACTC 


CATGGGCTAT 


CATTGTTCCA 


GGTGCTGTTA 


ACGTTTGGAA 


1560 


TATTATTCTT 


GCTAGGGCCT 


ATTTCCAAGG 


ATTGCCTGAA 


GAATTAGTTG 


AAGCTGCTGT 


1620 


CATTGATGGT 


GCAAATGATT 


TACAGATTTT 


CTTCAAAATC 


ATGCTTCCTC 


TTGCAAAACC 


1680 


AATTATGTTT 


GTTCTCTTCC 


TTTATGCTTT 


TGTAGGACAG 


TGGAACTCAT 


ACTTTGATGC 


1740 


AATGATTTAT 


ATCAAGGATC 


CAAACTTGGA 


ACCATTGCAA 


CTTGTACTTC 


GTAAAATTCT 


1800 


CATTCAGAGC 


CAACCAGGTC 


AAGACATGAT 


TGGAGCACAA 


GCGGCTATGA 


ATGAAATGAA 


1860 


ACGTTTAGCT 


GAATTGATTA 


AATACGCAAC 


TATTGTCATT 


TCCAGCTTGC 


CATTGATTGT 


1920 


TATGTATCCA 


TTCTTCCAAA 


AATACTTTGA 


TAAAGGAATT 


ATGGCTGGTT 


CACTTAAAGG 


1980 


ATAAAAAAAG 


AAAAAATAAA 


AGGAGTTTTC 


TCATGAAATT 


CAAAACATTC 


TCAAAATCAG 


2040 


CAGTTTTGTT 


GACAGCTAGT 


TTAGCAGTAC 


TTGCAGCCTG 


TGGCTCAAAA 


AATACAGCTT 


2100 


CAAGTCCAGA 


TTATAAGTTG 


GAAGGTGTAA 


CATTCCCGCT 


TCAAGAAAAG 


AAAACATTGA 


2160 
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AGTTTATGAC AGCCAGTTCA CCGTTATCTC CTAAAGACCC AAATGAAAAG TTAATTTTGC 2220 
AACGTTTGGA GAAGGAAACT GGCGTTCATA TTGACTGGAC CAACTACCAA TCCGACTTTG 2280 
CAGAAAAACG TAACTTGGAT ATTTCTAGTG GTGATTTACC AGATGCTATC CACAACGACG 2340 
GAGCTTCAGA TGTGGACTTG ATGAACTGGG CTAAAAAAGG TGTTATTATT CCAGTTGAAG 2400 
ATTTGATTGA TAAATACATG CCAAATCTTA AGAAAATTTT GGATGAGAAA CCAGAGTACA 2460 
AGGCCTTGAT GACAGCACCT GATGGGCACA TTTACTCATT TCCATGGATT GAAGAGCTTG 2520 
GAGATGGTAA AGAGTCTATT CACAGTGTCA ACGATATGGC TTGGATTAAC AAAGATTGGC 2580 
TTAAGAAACT TGGTCTTGAA ATGCCAAAAA CTACTGATGA TTTGATTAAA GTCCTAGAAG 2640 
CTTTCAAAAA CGGGGATCCA AATGGAAATG GAGAGGCTGA TGAAATTCCA TTTTCATTTA 2700 
TTAGTGGTAA CGGAAACGAA GATTTTAAAT TCCTATTTGC TGCATTTGGT ATAGGGGATA 2760 
ACGATGATCA TTTAGTAGTA GGAAATGATG GCAAAGTTGA CTTCACAGCA GATAACGATA 2820 
ACTATAAAGA AGGTGTCAAA TTTATCCGTC AATTGCAAGA AAAAGGCCTG ATTGATAAAG 2 880 

AAGCTTTCGA ACATGATTGG AATAGTTACA TTGCTAAAGG TCATGATCAG AAATTTGGTG 2940 

TTTACTTTAC ATGGGATAAG AATAATGTTA CTGGAAGTAA CGAAAGTTAT GATGTTTTAC 3000 

CAGTACTTGC TGGACCAAGT GGTCAAAAAC ACGTAGCTCG TACAAACGGT ATGGGATTTG 3060 

CACGTGACAA GATGGTTATT ACCAGTGTAA ACAAAAACCT AGAATTGACA GCTAAATGGA 3120 

TTGATGCACA ATACGCTCCA CTCCAATCTG TGCAAAATAA CTGGGGAACT TACGGAGATG 3180 

ACAAACAACA AAACATCTTT GAATTGGATC AAGCGTCAAA TAGTCTAAAA CACTTACCAC 3240 

TAAACGGAAC TGCACCAGCA GAACTTCGTC AAAAGACTGA AGTAGGAGGA CCACTAGCTA 3300 

TCCTAGATTC ATACTATGGT AAAGTAACAA CCATGCCTGA TGATGCCAAA TGGCGTTTGG 3360 

ATCTTATCAA AGAATATTAT GTTCCTTACA TGAGCAATGT CAATAACTAT CCAAGAGTCT 3420 

TTATGACACA GGAAGATTTG GACAAGATTG CCCATATCGA AGCAGATATG AATGACTATA 34 80 

TCTACCGTAA ACGTGCTGAA TGGATTGTAA ATGGCAATAT TGATACTGAG TGGGATGATT 3540 

ACAAGAAAGA ACTTGAAAAA TACGGACTTT CTGATTACCT CGCTATTAAA CAAAAATACT 3600 

ACGACCAATA CCAAGCAAAC AAAAACTAGA GGTTGATTAT GGGAGATAAG AAATACACAG 3660 

TAGAAAAAGC CAATCGTTTT ATAGCAGAAA ATAAACATCT CGTTAATACT CAATATAAGC 3720 

CTGAAGAACA TTTTTCAGCT GAGATTGGTT GGATCAATGA TCCAAATGGA TTTGTCTATT 3780 

TTCGTGGAGA ATACCATCTC TTTTATCAAT TCTATCCATA TGATAGTGTT TGGGGGCCTA 3840 

TGCACTGGGG ACATGCTAAA AGTAAGGACT TGGTGACTTG GGAGCACTTG CCAGTGGCAC 3900 
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TTGCTCCTGA CCAAGATTAT GACCGAAATG GTTGTTTCTC AGGCTCTGCC ATTGTCAAGG 3960 

ATGATCGCCT CTGGCTCATG TACACTGGAC ATATCGAAGA AGAAACCGGT GTCCGCCAAG 4020 

TGCAAAATAT GGTATTTTCA GATGACGGGA TTCACTTTGA AAAGATTTCC CAAAATCCAG 4080 

TTGCAACTGG ATCAGACTTA CCAGATGAGT TGATTGCTGC TGATTTCCGT GATCCAAAAC 4140 

TCTTTGAAAA AGATGGACGC TATTACTCCG TAGTAGCTGC CAAACACAAG GATAATGTGG 4200 

GCTGTATCGT TCTACTAGGG TCCGATAACC TAGTAGAATG GCAGTTCGAA TCCATCTTTT 4260 

TAAAAGGGGG AGAACACCAA GGTTTTATGT GGGAATGCCC AGATTACTTC GAGTTAGATG 4 320 

GGAAAGATTG CCTTATTATG TCACCCATGC GTTATCAGCG TGAGGGAGAC TCATATCATA 4380 

ACATCAACTC ATCGCTTTTG TTCACGGGTA AGGTAGATTG GAGAGAAAAA CGTTTTATCC 4440 

CAGAATCAGT TCAAGAAATT GATCATGGCC AAGACTTCTA TGCGCCTCAA ACATTGTTGG 4500 

ACGATCAAAA TCGTCGTATC CTGATTGCTT GGATGCAGAC ATGGGGGCGT ACCCTTCCAA 4560 

CCCATGACCA AGAACACAAG TGGGCATGTG CCATGACTCT ACCTAGAATT CTAAGATTGG 4620 

AAGATGGCAA ACTAAGACAA TTCCCTGTTA AAAAAGGCCA ATATCAAATC CAAATAGATA 4680 

AAGATTGTCA TTACCACTTA GGAAATGATA TAGATTATCT TGAATTTGGT TATGACAGTA 4740 

ATGCGCAGCA AGTTTACATT GATCGTAGCC ATCTTATTCA AAAAATTCTA GGTGAAGAAG 4800 

AACAGGACAC TAGTCGACGG TATGTAGATA TTGAAGCTAA AGAATTGGAA GTTGTTCTAG 4860 

ATAAAAATTC CATCGAGATT TTTGTCAATC AAGGTGAAGC AAGCTTGACT GCAACTTATT 4920 

ACTTAACGGT GCCAGCTGAG CTATCACGAA TTGATTAAAA ATTAAGTTAT TTCTCCTAAA 4980 

GAAAAAGTTC TCTTTCTAAA ATAGTGGAAA GAGGACTTTT TGTGTTTTGG GTATATAAGC 5040 

TTAGTTTATG GTATTTGTAA AATTGGTGTT GGATTATGAT TTAAGCTAGT TTTCTAAAGA 5100 

ATTTGAAAAA AATTTTATTT AAGCAAAAAA ACCTTGGTTC CAAGGCTTTT CCTGTTGTAT 5160 

TTAGATGCCC CCTACAGGGA TTGTAGGAGA TATGTTGCTT AGATGTTCTT GATTTTCTGG 5220 

TGTTTTGTAA CGTTTAAATG AGTTTTTTGA GTTTGTTGGT GGGGCGTTGC CCGGCAATTG 5280 

CCCGACTTAT TGCTTGAAAA AGAATTTAAA ATATAGTATA GTTAATTATA GATTAACACT 5340 

TGCTTGGAGG AACTGATGAA GAACAATGAA AGATTAGGTA TTAAATTAAG TAGAGATAGC 5400 

GTTTTAGGAT TGAGGGAAGT TAGAAGGCTT TATTTAGGCA GTTCAGATAT CCCAGTTTCT 5460 

GATGGCTATG TGATTGAAGT TGCTTATAAC CAGATATCAC ATGAGATTGA TATTATTGAT 5520 

TGGGTAGAGT TGAACAAGTC AAAAATTAAG ATAAGTGAAA TTAGTGAAAG CGTGGATATA 5580 

GATGCCACTA GCTTGAGAAC AACTTTGACT TTAGACACAT TAGTATATGA AGGTATGAGA 5640 

GATATACAGT TAAAGTTGAG AGAGCTTACA AAGGGGAGAG TATTCTTTTC ATTTGTAGTG 5700 
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AAGTTAGTTT TGTTTGCTTC TATTTTAAAG AAAAAAGATT TACTAGAAAA ATTTCAAGAA 5760 

AAGTGTTAAT CAAGTATTGA CACTTTATCT GGATTTCGGT ATAATATGCT TAGAAAGGAA 5820 

TCTTTCTAAA TTTTTTTCGT CCTTATGTGT TAATCAAAGA CGAATACAAA AACATATTTT 5880 

TTTACTCTAA AAAGTGTTAA TCAATGATGT ATTTGTTAGA GAGGTAGATA AATGGAATTG 5940 

AGAGCACCAC CAGTTATAAT AGTATAAAAC GTATAATAAA AATATTTTAA CTTGAATTAT 6000 

AGAAAAGGAG AAACAAATCA TGAAACAAAA ACAACCGATT GTTTCTAGAA CGAAACAACA 6060 

TACATTTGAA GAGCTTATTC AAGACCAAAA GTTAGAAAGA TTGGCTAAGT TGTCGCCCGA 6120 

TTTGGTTGGA AGGTATGGTT TTACTGCTAG CTGTGCGTCT TCATTTGCGA ACTTGATTAA 6180 

AGAAGCGTAT GGGGGTAAAA ATCTAAACGT AGTTTATGCG AGTCGGATGT TGGCTCTCTG 6240 

GAATATTGCT TGCAGTTGTT ATCATAAGGC TGATGGGTAT TCTTTAGCAG ATGCGCTTTT 6300 

TAGTGATAAA AAAATTTGTC TAGATTCTTA CTATTACCAC AAGAATACCT CTAATACCAT 6360 

AACTAGTGAT GTGATAAAAG ATGTTTACGA TAATTATAAT AATTATATGG TTTTAACTCG 6420 

AGAAGCGACA CCTGAATACA TTTATGTTGT ACAAACTGAA ATGCCAAAAG ATTCAGATTT 6480 

ATATTTTTAT ATTAGAGAAG TTCTGGGATT ATCGTTTAGT ACCATGCATT ATGCATTTTT 6540 

AGTCAAGGTT CTTGCAGGAG CGCTTGCTAG AAAATATAAG CCATATCGAA ATTGAATTAT 6600 

TTAAATTTAT ACTCTTCGAA AATCAAATTC AAACCAAGTC AGCTTCGCCT TGCTGTACTC 6660 

AAGTG CTGTC TGTGGCTAGC TTCTTAGTTT GCTTTTTGAT TTTCATTGAG TATTACTCTT 6720 

ATGGTAGTTA TTTATGGCAT AATAATATTG ATTTGGGAGT TATAGCGAAA ATTTTAGGTT 6780 

CTATAATATT TGTAGTGGGT AAACCACTAT AGATATTATG GAGCCTATTT ATTGTAGAAA 6840 

AAAGTCCCAT ATGA 6854 
(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xx) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TCCTTGCTAA GTTTATACTC AATGAAAATC AAAGAACAAA CTAGGAAGCT AGCCACAGGT 60 

TGCTCAAAGC ACCGCTTTGA GGTTGCAGAT AAAACTGACA CGGTTTGAAG AGATTTTCGA 120 

AGAGTATTAA TTTACATAAA TAGCCAGTGT TTGATAGGGT TTGAGTAGAA TTTTCTCAGA 180 
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CACTTCTGCA TCTTCATAGT 


TTGATATCAA 


AATCTGTCCA 


. TTTTGGTAGA 


. CTGCTGGCAA 


240 


GTCGATTTCA CTTCTTTAGC 


ATAAAAGTTA 


TTGAGCACTA 


GTAACTTTTG 


ATCCTCAAAC 


300 


TGGCGTTCAA AAGCGTAGAC 


TTGTTTGCTA 


TCTTCAAAGG 


CTGGTTTGTA 


ACTTCCTTCT 


360 


GAAATGATTG GCATTTCCTT 


ACGCATCGAA 


TCAAGTCTTG 


ATAGAAGGTA 


AAAATCGGAC 


420 


CCTGGATTTC ATTTTCTACA 


TTGATGTATT 


TATAGGATTT 


ACCAGCTTTC 


AACCAAGGAG 


480 


TGCCTGTTGA AAATCCTGCA 


TTTTCCGAAG 


CATCCCACTG 


CATGGGAATG 


CGTGAATTAT 


540 


CACGCGACTT AGCTTGAATA 


ATCTGGAAGG 


CTTCTTGCTG 


ACTCTTTCCT 


TCTTCTAAGA 


600 


GCATCTGATA GGCATTAAGC 


GATTCGACAT 


CCACATAATC 


AGCCATAGAA 


TCATAGTCTG 


660 


GGTCAATCAT CCCGATTTCC 


TCACCCATGT 


AGATATAAGG 


TGTCCCACGT 


GACAGGTGAA 


720 


TGCTGGCTGC TAGCATGGTG 


GCTCCTTCCT 


TGCGGAAGTT 


TTGAATATCG 


ACAAAACGGT 


780 


TCAAGGCACG TGGTTGATCG 


TGATTATTCC 


AAAAGAGGGC 


ACTCCAACCG 


TCTTTATCAC 


840 


TCATTTCCTT ACCCCAACTA 


TGGTAAAGAC 


TCTTCAACTC 


TTCAAAATCA 


AAGGGAGCCA 


900 


AGGTCCACTT TTGTCCATCC 


TTATAGTCCA 


CCTTGAGGTG 


ATGAAAATTA 


AAGGTCATGG 


960 


ATAATTCCTG ACGATCAGGC 


GACGAATAGA 


GGACACAGTT 


TTCCATGGTG 


GTAGAAGACA 


1020 


TTTCCCCAAC TGTCATAAAG 


CTATCGTCGG 


ATCCAAAAGT 


GGCTTGGTTC 


ATCATACGCA 


1080 


AATAGTTATG AACGATGGGT 


TTGTCTGTAT 


AAGCTGGCTT 


CCCTTCATTT 


TCAGGACAGT 


1140 . 


CCACTGAAAC CTCGTCCTTA 


CCGATCAAAT 


TGATCACATC 


AAATCGGAAA 


CCTTTGACAC 


1200 


CCTTGTCGCG CCAGAAATTA 


ACAACCTTGA 


AAAGCTCCTT 


ACGGACATTG 


GAATTG CGCC 


1260 


AGTTAAGGTC AGCCTGGGTC 


TCATCAAATA 


GGTGAAGATA 


GTATTTCCCA 


GTATCCCCGA 


1320 


AAGGCGTCCA TGCAGAACCA 


CCAAACTTAG 


ACTGCCAATC 


TGTTGGTTGG 


TCTTGGATGA 


1380 


AGAAAAAGTC TTGATAATAC 


TTATCACCAG 


CTAGGGCTTT 


CTGAAACCAT 


TCATGCTCTG 


1440 


TCGAACAATG ATTAAGTACC 


ATGTCCAGCA 


TAAAGTCAAT 


CTTGTGCTCT 


TTACCGACAC 


1500 


ACACCATTTT CTCAAAATCA 


GCCATATCAC 


CAAAAAGAGG 


ATCCACTGCC 


ATATAATCTG 


1560 


AAATATCGTA ACCATTATCC 


CGTTGAGGGC 


TTGGATAGAA 


TGGATTGAGC 


CAGACCATAT 


1620 


CCACACCTAG TTTGGCTAAA 


TAGGGAATTT 


TTTCGATAAT 


CCCACGGAAA 


TCCCCAATAC 


1680 


CGTTTTCAGT GGTGTCTTTG 


TAAGATTTTG 


GATAGATTTG 


ATAGACTACT 


TTTCCTTTAT 


1740 


CAAGTGTCAT CTGTTTCTCC 


TTTTCTGATA 


AAAGGGAGGA 


AGCAGTCTTC 


CGTCCCTATT 


1800 


TGTGCTATTT CAATTATACT 


CAATGAAAAT 


CAAAGAACAA 


ACTAGGAAGC 


TAGCCACAGG 


1860 


TTGCTCAAAA CACTATTTTG AGGTTGCAGA TAGAGCTGAC 


GTGGTTTGAA 


GAGATTTTCG 


1920 


AAGAGTATTA GATTCGTGTA 


GCGACCATGA 


GAGATGCTCC 


AGCTTGGATC 


GTTGTCGGAT 


1980 
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AAGTTCCGGG AATAGTCGCT GTATAAGCAT CTTGGTTGGT GATGATAACA GGAGTTTCTG 2040 

TCACCAGACC TGCAGCCTTA ATGACATCCA TATCAAAACG AATCAGTTGC TGACCAACTG 2100 

TAACGTGATC TCCTTGGACT ACAAGACTTT CAAAACCTTT GCCATCAAGA CCTACTGTAT 2160 

CCATACCGAT GTGGATGAGC AATTCAACTC CCTCGTCAGA GACAATGCCG ATGGCATGCT 2220 

TGGTAGGGAA AAGAACCGTC ACTGTCCCAT TAACTGGAGA GGTCAACTCA CCTTGGCTTG 2280 

GTTCAATGAC TAGACCTTGC CCCATGACAC CTGATGCAAA AATAGGATCC GTCGCTTGAC 2340 

TCAATTCTTT CACTTGGCCA GTTAGTGGGC TGATAATTTC TACCGAAGTA AGTTCTACTG 2400 

GTTCATGGTT CACAAATTCT GCTTCTTCTT GAGCAACGAA TTCTGCCTGC AAGTTCGTAT 2460 

CGCCCTCTGT TTTTGTAAAG AGACCAGCCT TGCGGAAGAA GAAAGTCAAG AGCATTGGAA 2520 

CAACAATCGC AACTAGCATA GTTCCTGCAA ATGGCAGCAT GTATTGAGGT TGAATAGAGA 2580 

GAATACCTGG CAAACCACCG ATACCAATAG AAGCCGCAGT TACATTAAAA GTAACGGATA 2640 

ACATGCCTGC AAGGGCTGAA CCAGTCATCC CAGCAACAAA TGGATAAATA TATTTTACGT 2700 

TAACCCCAAA AAGAGCTGGT TCTGTAACAC CGAGATAGGC TGAAATGGTT GCAGGAAGTG 2760 

AAACCTGAGC CTCACGCTCA TCATGGCGAT GCATGAAATA ATAGGCAAAC ACGGCTGAGC 2820 

CTTGAGCAAT ATTAGAAAGA GCAATCATTG GCCATAGGGC AGTGCCACCA GCATCCGCAA 2880 

TCAATTGTGT ATCAATGGCA TTGGTCATAT GGTGCAGACC TGTGATGACA AATGGAGCGT 2940 

AGAGGGCGCC AAAAATTGCA CCGAAGAGCC ATTTAACTGG ACCAGTTAAA CCTGCCAAGA 3000 

CAACTGATGA AAGTCCTTGT CCAATTGTCC AACCGATTGG TCCCAAAACA GTATGAGCCA 3060 

AAATCAAGGC TGGAATCAAT GACAAGAAAG GTACAAAAAT CATAGAAATG ACTTCTGGGA 3120 

TATGCTTGTG CCAGAAGATT TCAAGATAAG ACAGACTCAA ACCTGCAAGC AAGGCTGGGA 3180 

TAACTTGGGC TTGGTAACCG ATACGATTAA CAGTAAAATA GCCAAAATTC CAAACCCAGT 3240 

TTGCCGCGAT ATCAGCTGCT GGCGTTGAAG CAACCGCATA GGCATTGAGC AACTGAGGCG 3300 

ATACCAAACA GATTCCGAGA ACAATTCCCA AAATTTGGCT GGTTCCCATC TTACGAGAAA 3360 

CAGACCAAGT AATCCCTACT GGTAAGAACT GGAAGATAGC TTCACCAGGC AACCAGAGGA 3420 

AGTGATTGAC ACCTGCCCAA AACTGAGAGG ATTCTGTGAT GGTCTTGCCA TCCAACATCG 3480 

ACCAATGGAC ACCTTCCAAG ACATTACGGA AACCGAGGAT CAATCCTCCG ACTATCAAGG 3540 

CTGGAATAAT CGGAGTAAAA ATCTCCGCCA GAGTGGTCAT AACACCTTGG ACCACGTTTT 3 600 

GATTACTCTT AGCTGCAGAC TTGGCTGCTT CTTTGGAAAC ACCCTCAATA CCTGAAACGG 3 660 

CTGTAAAATC ATTATAAAAG ATGGGCACGT CATTTCCAAT GATTACCTGA AATTGACCTG 3720 
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CATTTGTAAA GGTTCCTTTA ACAGCTGGAA TTGACTCGAT AGCTTTAACA TTAGCCTTCT 3780 

TATCATCTCC TAAAACAAAC CGCATCCGTG TCGCACAGTG AGTTACGGCA GTCACATTTT 3840 

CTTTGCCTCC GATTGCCTGA AGCAGATCTT TGGCTTCTTG TTCAAATTTT CCCGG 3895 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3936 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



AGGATCGCCG 


CTCCAGCTAC 


TAAGTCTCGT 


GCAGTGCCGA 


TTTATCAAAC 


AACATTTTTT 


60 


GTTTTTGATG 


ACACGTAGGA 


AGGTGCCGAT 


CTGTTTGCCT 


TGAGGAAACC 


AGGGAACATT 


120 


TATACTCGTA 


TCACCAATCC 


TACAACAGCT 


GCCCTTGAAG GTGGTGTTGA AGCGCTAgcA 


180 


ACAGCATCAG 


GTATGACTGC 


AGTGACTTAT 


ACGATTTTGG 


CGATTGCCCA 


TGCTGGTGAC 


240 


CATGTAGTGG 


CTGCTTCGAC 


TATTTACGGT 


GGAACCTTCA 


ATCTTTTGAA 


AGAACCCCTT 


300 


CCTCGTTATG 


GTATCACAAC 


AACCTTTTTC 


GATATTGATA 


ATTTGGAGGA 


AGTAGAAGCA 


360 


GCTATCAAAG 


ACAATACCAA 


GCTTGTCTTG 


ATTGAAACCT 


TGGGTAACCC 


CTTGATTAAT 


420 


ATTCCAGACC 


TGGAAAAACT 


GGCAGAGATT 


GCTCATAAAC 


ATCAAATCCC 


ACTTGTGTCA 


480 


GACAATACTT 


TTGCAACACC 


TTATTTGATT 


AACGTCTTCT 


CTCATGGCGT 


TGACATTGCC 


540 


ATTCACTCTG 


TGACTAAGTT 


TATCGGTGGG 


CATGGTACAA 


CTATTGGAGG 


AATAATTGTC 


600 


GATAGTGGTC 


GTTTTGACTG 


GACGGCTTCA 


GGGAAATTCC 


CTCAATTTGT 


TGACGAGGGT 


660 


CCAAGCTGCC 


ACAATTTGAG 


CTATACTCGT 


GATGTGGGTG 


CAGCAGCCTT 


TATTATAGCT 


720 


GTTCGAGTTC 


AATTGCTTCG 


TGATACAGGT 


GCAGCCTTGT 


CACCATTCAA 


TGCTTTCCTC 


780 


TTGCTACAAA 


GACTTGAAAC 


CTCTTCACTT 


CGTGTGGAAC 


GCCATGTACA 


AAATGCTGAG 


840 


ACAATTGTTG 


ATTTTCTTGT 


CAACCATCCT 


AAGGTAGAAA 


AGGTAAATTA 


TCCAAAACTT 


900 


GCAGATAGTC 


CTTATCATGC 


CTTGGCTGAG 


AAATACTTGC 


CAAAAGGTGT 


CGGTTCAATC 


960 


TTTACCTTCC 


ACGTCAAAGG 


TGGCGAGGAA 


GAAGCACGCA 


AGGTCATTGA 


TAATTTAGAA 


1020 


ATCTTTTCTG 


ACCTTGCAAA 


CGCGGCAGAT 


GCTAAATCGC 


TTGTTGTCCA 


TCCAGCAACA 


1080 


ACCACTCACG 


GTCAATTGTC 


AGAAAAAGAC 


CTAGAAGCAG 


CAGGTGTCAC 


ACCAAACTAA 


1140 


ATTCGTTTGT 


CAATCGGTCT 


TGAAAATGTA 


GAAGATTTGA 


TTGAAGACTT 


GCGCTTGGCC 


1200 


TTGGAAAAAA 


TTTAAAGTAA 


AAGAAGATAA 


ACAGTGGGCT 


TCGACTCACT 


GTTTTTGATT 


1260 
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TTCCCTCAGG CATGATATAA TGGTTACAGA AGTCTAGAAA GAGGAACGAT ATGAACGAAA 1320 

TCAAATGTCC CAACTGTGGG GAAGTCTTTA CAGTAAATGA GAGTCAGTAT GCCGAACTCT 1380 

TGTCCCAAGT GAGAACGGCA GAGTTTGATA AGGAACTACA CGATAGGATG AAGCAGGAAC 1440 

TGGCCTTGGC TGAGCAAAAG GCCATGAATG AGCAACAGAC TAAACTGGCT CAGAAGGATC 1500 

AAGAAATTGC GCAATTACAG AGTCAGATCC AAAACTTTGA TACAGAAAAA GAATTGGCCA 1560 

AGAAAGAGGT TGAACAGACA AGCCATGAGG CTCTCTTGGC TAAGGACAAG GAAGTACAGC 1620 

TCTTAGAAAA TCAGTTGGCT ACCTTGCGTT TGGAGCATGA AAATCAACTA CAAAAGACCC 1680 

TTTCTGACCT AGAAAAAGAA CGGGATCAGG TTAAAAACCA ACTACTTTTG CAGGAAAAGG 1740 

AAAATGAATT ATCTTTGGCT TCTGTTAAGC AAAACTACGA AGCCCAGCTC AAGGCAGCTA 1800 

GTGAACAAGT CGAGTTTTAT AAGAATTTTA AGGCTCAACA ATCTACAAAA GCGATTGGGG I860 

AAAGCCTAGA ACAGTATGCA GAGAGTGAGT TTAACAAGGT TCGTAGTTTC GCCTTTCCAA 1920 

ATGCTTACTT TGAGAAGGAT AACAAGGTCT CTTCGCGTGG GTCTAAAGGG GACTTTATCT 1980 

TCCGTGAGTG TGATGAAAAT GGAGTTGAAA TCATTTCTAT CATGTTTGAG ATGAAAAACG 2040 

AAGCGGACGG AACAGAGAAG AAGCACAAGA ATGCAGATTT TTACAAGGAA TTGGACAAGG 2100 

ACCGTCGGGA GAAGAACTGT GAGTATGCCG TTTTGGTGAC CATGCTTGAG GCTGATAATG 2160 

ACTACTTTAA CACAGGGATT GTTGACGTCA GTCACGAGTA TGAAAAAATG TATGTTGTTC 2220 

GTCCTCAATT CTTTATCCAA TTGATTGGTC TCTTACGTAA TGCGGCGCTA AATTCCCTAA 2280 

AATACAAGCA GGAGTTGGCC TTGGTTCGCG AGCAAAATAT TGACATTACG CATTTTGAGG 2340 

AAGATTTGGA TGCCTTTAAG CTAGCTTTTG CTAAGAACTA TAATTCAGCT TCGACTAACT 2400 

TTGGAAAAGC TATTGATGAA ATCGACAAGG CCATCAAACG CATGGAAGAG GTTAAGAAAT 2460 

TCCTGACCAC ATCTGAAAAC CAACTCCGTT TAGCTAACAA CAAATTGGAA GATGTCTCTG 2520 

TTAAAAAATT GACCCGGAAA AATCCAACAA TGAAAGCGAA GTTCGAAGCA CTGAAGGGGG 2580 

AGTAGAAAGC AAAAATGAAC GGTATTATTA ACTTAAAAAA GGAAGCAGGA ATGACCTCGC 2640 

ATGATGCGGT TTTTAAACTG CGTAAGATTT TGGGAACCAA GAAAATTGGT CATGGTGGAA 2700 

CCTTGGATCC GGATGTGGTG GGTGTTTTGC CGATTGCGGT TGGCAAGGCG ACACGCATGG 2760 

TCGAGTTTAT GCAGGACGAG GGTAAGATCT ATGAGGGGGA AATCACTCTG GGCTATTCCA 2820 

CGAAGACTGA GGATGCTAGT GGGGAAGTGG TCGCAGAAAC CCCTGTTTTG TCTCTCTTGG 2880 

ATGAAAAGCT TGTTGATGAA GCGATTGCTA GCTTGACTGG GCCTATTACT CAGATTCCCC 2940 ' 

CTATGTATTC GGCAGTTAAG GTTAATGGTC GCAAGCTCTA TGAGTATGCG CGTGCTGGTC 3000 
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AGGAAGTGGA GCGTCCAGAA CGTCAGGTGA CCATTTATCA ATTTGAGCGA ACAAGTCCGA 3060 

TTTCTTATGA TGGCCAACTT GCCCGATTCA CTTTTCGTGT AAAATGCAGT AAAGGGACGT 3120 

ACATCCGTAC TTTGTCAGTT GATTTGGGTG AAAAGCTTGG TTATGCGGCT CATATGTCCC 3180 

ATTTGACTCG TACTAGTGCT GCTGGCTTAC AATTAGAAGA CGCTCTTGCC TTGGAGGAAA 3240 

TTGCTGAAAA AGTAGAGGCT GGGCAATTAG ATTTTCTCCA TCCTTTAGAG ATTGGGACAG 3 300 

GTGACCTTGT CAAAGTTTTC CTAAGTCCAG AAGAGGCTAC AGAAGTTCGC TTTGGTCGTT 3360 

TTATTGAGCT AGACCAAACG GACAAAGAAC TGGCTGCCTT TGAAGATGAT AAATTGTTAG 3420 

CCATTCTAGA AAAACGGGGC AATCTCTATA AGCCAAGGAA GGTTTTTAGC TAGATCGTTT 3480 

AGGAATAAAA ATCGGGTGAT AGATAACAAT TGCTTGATAA AACCCCATAC TAATAGTAGA 3540 

ATGGTTTTGG GAATTATAAT ATTCCAATTG TTGCGAGTTG TAGGTACTCA AATAATCTAT 3600 

ATAGAAATTT AGAGGTGTGA AATGAAGCAA TTTAAAATTC TTTCAGATAA AT ATT TAG AG 3660 

TCCATTACAG GTTCTGATGG GAACTTAGGC CCAGGATTTG GTGTGATAAT TCCATGATGC 3720 

GAAATGAGTT TCGAGAAAGG GTGGAGCAAC TTCTTCAACA AAAAGAAATA AATGAAAATA 3780 

GTGAGTTGAG TCACCTGTTT CGTCTTGCTA TACAAAATTT AGACAGAAAT GAAAAATACC 3840 

AATCGGTCAT GGCCAATTTG AGTCAAGGGT TGTCACTTTA CCTCATGACG CATC ATT ACC 3900 

AGGCACCTAA GTCTGTCATT GATTTTGGTT TATGGA 3936 
(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

CATCCAGCAA CTGCTCCTCT GAGCGTTTCA AAATTGATGT AATTTTTCTA GTTTTTTCTA 60 

ATAAATGTGC CATTTTTCAC CTCGAATTTA ATCGCTATCA TTATAACATA AAAACGTCTC 120 

TTTTTCAATA ATTATCTGAA AATTCCTTAT TGACTTGCAT TGACTTACAA TTTAATTAAA 180 

AACCAGAATA TTTTTAATTA AATTGTTCCT TTTCTATTGA CAAGTTGCCT ATTTTTGTGT 240 

ATCATAATAT TATAAAAGAT AATATAATAA TTTTATTTGT CTTTTCACAT TCGGTCTCCT 300 

TATATAAAAA AGCGATTCAT TTTGAACCGC TTTTTCTTAT TTATCGCCTT TGTTACGAAT 360 

AACAAAGCCT GTTTGCTTTT CGCTTAAAGT ATTGCGTGGT TTTTTATTAT CCTTACGGTA 420 

ACGTTTTTCC TTATCAAAAC GATCGTTGCC ACGACTTCCT TTTTTGAACT CATCACGGCG 480 
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ACCATTGCCA CGGCGATCAC GCTCTCGACG GTCGTCCCCA CGACGGCCTC CACGACCTCC 540 

CTTAGCTTTA CCACCGAAAC CATTACCTGA TGGTTTAAAC GGTAGTGGtT TTTCACGTGC 600 

AATCTCCACT TCTGGAAGGC TATCTGGGTC TTGGACTGTC AGACTCAAGA TATACATTGC 660 

CAATTCTTCT GGAGTAAACT CAGCAGCCAA TTTGCGAGCA TCCTTACCAA ATTTCTCAAA 720 

GTTGGCACGA ATGGTTTCAT CTGCAAAATC ACGTTCGATT TTCTTGAGAG CTACCTGTTT 780 

TTTTGATTGG AAGGATTCTT CTACACTTGC AGGTTTGAGA CCTTTCATGC GTTTCTTAGT 840 

CAAGTTTTCA ATGATTTGAA GGTAACCCAT TTCGTTTGGA GCAACAAAAG TAATAGATTG 900 

ACCTGACTTA CCAGCACGAC CTGTACGACC GATACGGTGA ACATAACTCT CAGGATCTTG 960 

TGGAATATCG TAGTTGTAGA CATGGGTCAC ACCTGAAATA TCCAAACCAC GCGCTGCAAC 1020 

GTCTGTCGCA ACCAAAACAT CAAGATTGCC ATTTTTAAAG TCACGAAGGA CACGAAGACG 1080 

TTTGTTTTGG TCTAGGTCGC CATGAATTCC TTCTGCACGG AAGCCACGAA TTTTCAAACC 1140 

ACGAGTCAAT TCATCCACAC GGCGTTTGGT ACGACCAAAT ACAATAGCGA GTTCTGGTTG 1200 

TGCCACATCC ATGAGACGAG TCATGGTGTC AAATTTTTCT TGTTCCTTAA CACGGATATA 1260 

GTACTGGTCA ACCAATTCTG TTGTCAATTC CTTAGCCGCA ATCTTGACAT GTTCAGGGGC 1320 

TTTCATAAAC TGAACACCGA TACGTTTGAT GGCATCTGGC ATAGTTGCTG AGAAAAGCAA 1380 

AGTTTGACGG TTCTCAGGTA CACGGGAAAT AATGGCTTCG ATGTCTTCAA GGAAGCCCAT 1440 

GTTAAGCATT TCATCCGCTT CGTCAAGGAT AAGGGTTTCA ATGTCTTGTA ATTTCAAGGC 1500 

CTTGCGTTTA ATCAAGTCCA AGAGGCGACC TGGAGTTCCC ACCACAATAT GGGCACCAGA 1560 

TTTAAGAGCC TTAATTTGTT TTTCAATGCT TGATCCGCCA TATACTGAAC GGACTTTGAC 1620 

TCCCTTACTA CGACCAAAGC GGAAGAGTTC TTCTTGACTT TGGACAGCTA GTTCACGAGT 1680 

TGGAGCGATG ACCAAGGCTT GGATAGTCGC TTCTTCTGTA CGGATTTTTT CAAGGGTAGG 1740 

CAAGCCAAAG GCTGCAGTTT TTCCTGTACC AGTCTGAGCT TGACCGATAA CATCCTTGOC 1800 

TTCAAGGGCC AAAGGAATAG TTTGTTCTTG GATAGGACTA GCTTCTACAA AACCAGCTTT 1860 

TTCAATTTCT GCTAGCAAAT CAGCAGACAA GTTTAATTCA TTAAATTTCA CGTTATTCTT 1920 

CTTTCTAAAG GTGGTGCGAA GCCACCCTAT AGGGCTTAGT TTATACTTTT CTTTTTATGA 1980 

CGTATTTTCA TATAACTAGA TATAAAATCG TGTTGCTTCT TTTCCACAAA AGAAAAGTAC 2040 

TGTTTTCTTT GCAACCTATC TAGTATAACA CAAGACCAGA GCAAAAGATA GCCCCATTTC 2100 

TACAGAAAAT CATGTAAGCG CTTTTTGACT TTCTTTTTTG ATTGAACGAC CTAGATAATA 2160 

AGACAAAGCC AAGGCGATAC TGTATAAAAT GAGAAAAACG AACAAGGTTT GTGTGTACGA 2220 



> 
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ATGAGCCATT TTATAAGTCT CTGCTAATAA AATAGGTCCC GCTAAACCAG CCATTGCCCA 2280 

AGCTGTTAAA ATATAACCAT GCAGAGCGGC CAATTCCTTG GTTCCAAAAA TATCACTGAG 2340 

ATAAGCTGGA ATCAAAGAAA AACCAGCTCC ATAGCAAGTC ATCAAAATAG ACATAGCAAC 2400 

TACAAATAAA ACGGAATCTG TAAAGAGCCA AAGTGAGAGA GAAAAGAAAA GATTGACAAG 2460 

CAGTAATATA CTAAAGGTTA GAGGGCGACC GATATAGTCA GACAAACTCG CCCAGAGCAA 2520 

GCGACCAAAT CCATTGAAAA TCCCCAAAAC ACCCACCATT ACTGCTGCAT GACTTGTAGA 2580 

CAAGCCAGCC ATCTCCTGTG CCATTGGCGA TGCCGCTGAA ATTAAGCCTA AACCACAAGC 2640 

TATGTTGATA AAGAAAATAA TCCAAAGCAT ATAAAACCGA TTGCTTTTTA GAGCCTGATT 2700 

TGCAGCCATT CCTTGCGTCA AAGAGGCTGT TTTTTCTTTC CCTGAAGAAG ATAAAATTGC 27 60 

AAGCTCTTGC TCATTTGGAC GCTTAATGAA TTGTGAAGCT AGGAGCATGA TAATAAAGTA 2820 

ACTTGCTCCT AAAATATAAA AAGTTTCTAC AAGCCCTACC CCTGCGATGA GGTGTTGCGC 2880 

TATGGGACTA GTCAATAAAG AAGCAAAACC AAACCCCATA ATCGCTAAAC CTGTTGCGAG 2940 

ACCACGTTTA TCAGGAAACC ATTTTATAAT CGTCGACACA GGGGTAATAT AGCCTGCTCC 3000 

CAAACCAAGC CCACCTAAAA TGCCATAAGC GAGATACAAC AACCACAGCT CTGACGGTCT 3060 

ATTGCAAATC CTGTTAAGAT ATTTCCACCT GCGTATAGAA AAGCAGATAG ACTTCCCATG 3120 

ACTTTCGGAC CAAATTTTTC TACCAAACGC * CCCATAAATG CAGCCGATAA GCCCAAACAA 3180 

AAGATTGCTA GACTAAAGGC GAAGGCAACA GAAGCCTGAT CCCATCCCGT 3230 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5096 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

CCTATGAAGA CTGTCCCAAC TGGGTGTCCT TCTAGGCTAT CTGGTCCTGC CACTCCAGTC 60 

AAACTAATTC CAAAATCAGA CTGGGTCTTG CTTCGTGCCT GCTCAGCCAT CTTCTGAGCT 120 

GTAAATTCAG ACACCACACC ATGTTCTTCC AAATTCTTGG CAGGAATATC CAACATCCTT 180 

GATTTTTCCT CCAAGCTATA GGTCACAAAA CCACCCTTAA ATATACTTGA AACTCCAGAA 240 

AAATTCGCCA CGGTAGCTTG GAAAAGACCT GCCGTCAAAC TCTCTGCAGC CGCGATGGTT 300 

TTCCCTTGCC TTTTCAGTTC TTCTACCACA ATGCTGGCTA AACTAGTTTC TTCCCCATAA 360 

CCATAGCAAA AGTCTCGTAA AGAAATTCCT TCGAAAGTCT GGCAGTCCAA GATTTGATTT 420 
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TCCAAGATAT 


CCAGCGCTTG 


ATTCGCCTCT 


TCTTGACTGC 


TAGCCTTTGT 


TGACAGACGT 


480 


AGAGTGACTT 


CTCCTGTCTT 


GGCATAAGGG 


GCCAAGGTAG 


GATCGATCTG 


ATTATCAATT 


540 


AAATCAGCCA 


AAATCGTAAC 


CAACTGGCTC 


TCGCCAATCC 


CAAAGAAACG 


AAGAACTCGG 


600 


GAATACAGCT 


TGCTCCCTGT 


CATCAACTTG 


GGTAGAAGTT 


GGTTTAAGAC 


CATGGGTTTC 


660 


AATTCACTTG 


GCGGACCTGG 


AAGGACGACA 


TAGGTCACTC 


CGTCTACTTC 


TAATTTTCCT 


720 


CCAACAGCCA 


GTCCTGTTTC 


GTTTGGCAGT 


GGAATCGCTC 


CTTCTACAAT 


TTGAGCTTGT 


780 


CTTTCGTTAT 


TCGGTGTTCG 


GGCATAGTCT 


GGTCGCAGGG 


TAAAAAAGAT 


ATCCAACTTC 


840 


TCCTGAGCCT 


GAGGATCAAA 


GACTAATGCT 


TTCCCTAAAA 


ATTTAGCTAG 


GGTTTGTTTG 


900 


GTTAGGTCGT 


CCTCAGTTGG 


CCCCAAACCG 


CCTGTCAAAA 


TCACCAGACT 


GCTACGTTGA 


960 


CTGGCAATCT 


CAAGCAAAGA 


CAAGAGACGA 


ACTTCATTGT 


CTCCTACAGC 




1 fi?n 

XUii u 


TATACATCTA 


CCCCAATCTC 


AGCTAGTTTT 


TCCGACAAAA 


ACTGGGCATT 


GGTGTTGACA 


1080 


ATCTGCCCTG 


TCAAAATCTC 


TGTTCCAACA 


GCAATGATTT 


CTGCTTTCAT 


GTTTCCTCCT 


1140 


ACCTATCTAT 


TCGTATTTTT 


TTGAAAAAAT 


CGCAGGAATT 


TTCCTACGAT 


TGATTTTTTT 


1200 


ATTTGTATCA AAAGTTAATT 


ATCTTCATCA 


CCAACAGGTG 


CTCTGCCAAA 


TAAATCTTCA 


1260 


AATAAAACCG 


CATTGGTTTC 


AAGCTGAGTA 


ACTTCTTCTT 


GTCCCAAAGA 


ACGTCGGAGT 


1320 


AGATTTTGCA 


TTTCCAACAT 


ATGTGCTCTC 


GAAACAATCT 


GGTAAGAAAC 


ACCTTGAAGT 


1380 


ATCTCTCCTT 


CACCCTGCAA 


CTGCTGAGTT 


TCAATGGTTT 


TAAATGAATC 


TTTATAGCCT 


1440 


AGCAAGTTAG 


GGATACTTTT 


TGCAGACAAA 


TCAATATTGG 


TCTGCATATT 


GTCACTCAAA 


1500 


GCTTTTAGAA 


TCTCTTGATA 


ATGACCAATG 


CTATTTAAAC 


TGAGAGCTTT 


TTCCATGACT 


1560 


TTTTGAATAA 


CTTCACGTTG 


ACGTTTTTGA 


CGACCATAAT 


CCCCCTCAGG 


ATCTTGGTAA 


1620 


CGCATTCGTG 


CATAGACTAG 


GGCTTCTTCT 


t-v_CCCAATAT 


GTTGCTCCCC 


AACACCGATA 


1680 


GAAATAGTAT 


TAAATTCTTC 


TTGGTCACTG 


ATAGAAATTG 


GGAAACCTAG 


GATATTATTG 


1740 


ACTGTAATAC 


CTCCTACTGC 


ATCCACTAGT 


TTTTGCAATC 


CTCTCATATT 


GACCATCACA 


1800 


TAGCGATCAA 


TATGGATATT 


CATCATTTTT 


TGAATGGTTT 


CTATAGCAAG 


CTCTGCTCCA 


1860 


CCATCTGCAT 


ATGCTGAGTT 


CAGTTTCGCT 


TCATGAGCCT 


GACCATTCCC 


TGATTCAATG 


1920 


CGCGTCAGAA 


TATCCCGCTC 


TAAACTCATC 


ATTGTTGTTT 


TTTTCGTTTT 


AGGATTCACT 


1980 


GTCATCAAGA 


TCATGCTATC 


ACTTCTACCG 


ACCCAAGTTT 


CAGTTCGTTC 


AACATTTCCG 


2040 


GTGTCCACTC 


CCATTAACAG 


AATGGTTAGA 


GGTTCAGTCG 


CTTCAATAAC 


CTTGGTTTCT 


2100 


TCACCGATTT 


TTTTATAGGT 


TTTAGCTAAG 


GTTTCTGTCC 


CTTGTTGATA AATAGTATAA 


2160 
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GCAAAAACAC CTACTCCTAC TACAGTTACA GAAAGTAAAG CTAGCACCAT TCCAATAATT 2220 

TTTTTAACCA TATTTCTACT AACCTATCAG TTTACCCATC AAGTAAACAT CGATAAATTT 2280 

CCCTTCTTCT ATATATGCCC CACGCTCTTG GCTACCTTCA ATGACAAAGC CATGCTTTTG 2340 

ATAAAGATGG ACTGCTGCTT GATTACGAGT TTGGACAGTC AGTTGGAGAC GACGCAGAAT 2400 

GCCACTTGCT TGTGCCCACT CTATCGCTTC TTCTAGCAAC AAACTTCCCA AGCCATTATT 2460 

CCAATATCTT TTTCCAATCA CAATGAAGAG ATCTCCAATA TGACGGACTC TCTTACGCTG 2520 

ATCAGCTGTA ATATTTACAA TACCAGCAAT TTTGCCATTT AAGAATGCAA GTAAGGTTAT 2580 

CTGATTGTCC GAACTAGCTT GCTTGTTGAG GAATATTTCC ATCTCCTCAC TAGTCAAGAG 2640 

AATACCATCT CCGTCTAGGC TGGTAAAGTC TGTCTCCAAA CTCACACGAT TTAAAAAGGC 2700 

CACTAATTCA GCTGCATCTT TGGGCTCTGC TTCCCTAATG AGCAATTCAT ACTCCATATT 2760 

GAAGCTCCTC TAACAATTTC TCAGCACGCA AACCCTTTGC CTGAAAATTT AAACGGCGTC 2820 

CATCTGCTTC TTTTAGAATT TCCAATTCTA AATAAGCATC TGGCAAGGCA TCTCCTAAGA 2880 

GATTTCCCCA CTCAATAACA GTCACGCCGC CACCAAAGAT AAACTCATCC AAGTCGATAG 2940 

AATCAGCATC TCCTTCAATA CGATAAACAT CTAGGTGATA AAGTGGAAGT CGACCTTCAT 3000 

ACTCTCTCAC GATAGTATAG GTGGGACTTT TAATCATTTG AGAAATCTGT AATCCTTTTG 3060 

CAAGTCCTTT AGTAAAGGTC GTTTTACCTG CACCCAGTTC TCCAGTTAAG ATTAAAACAT 3120 

CATTCTTTGC TAATAGATGG CCCAAACGCT CCCCTAAGGC TTGCAACTCT TCTTCATTTT 3180 

TTGTGTACAT ACTCTTATTA TACCAAAAAC TTTTCTTTTG TGTCTATTTT CCTACTAAAC 3240 

TTATCATCAT AACATCCATA AAAAACAGGC TTTCTCTAAA AGAAAATGAG CGTAACAATG 3300 

ACCAATACAA GATCTCGGAA AATATGACCA TAAAAGGAAA CTTCCTTCTT AACCGAATTT 3360 

GGGACAAGAT AGGCTGCAAA AAACAAGCCC AGTCCAATAT AAATCAGAAG TGAGACAATG 3420 

GTCATTGGAT TTCTTAAGAA AAGAAGTGTT GCTAAAATAG TCACCAACAC TGTCTTTTTT 3480 

CTGTCCAGCA TAGCAAGAAA ATCGCGCACG TATTTTTTCA AGGGTAAAAA AATCAGCAAA 3540 

TCTAGCCCAA ATAGGAAAAA GAAGGATGGC AATAAAAAGT CAACTAATTC TTGCTGCAGC 3600 

GTATTTTTGA TGAACAAGTT ATCTGACAAA ACAAGAACAG CTCCTAACAA ATTAATTAAG 3660 

AGTAACATAC TGTAAAAAAG CTTCACCGAC TTCTTACTGG CTAGGACACT ATGGACTTCT 3720 

TGCTTACGGG TATAAAGATA ATTTACTCCA GCACAGATTC CTGAAACGAA AACCATGCTT 3780 

CCGATGAAAA AAGCTGTACT TTGTTTAAAG GACAAGATGC ATTCCTTCCA TAGGAAACAG 3840 

CTACTCAAAC TGATTTGAAT TAAAGCTAAC AAAAATAAGA TTCTCATTGA TTTCATCTTC 3900 

TCTCTCCCTT CCTACCAATC ATTATACTAG GAGAAAAGAG AGAACTGTTT CTAATCTTCT 3960 
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CAAATGTCTC TTTAAGACGC TAAACAAACA CTAGAGACTA ATACTCAATG AAAATCAAAG 4020 

ATCAAACTAG GTAGCTAGCC ACAGGTTGCT CAAAACAGTG TTTTGAGATT GCAGATAGAG 4080 

CTGACGTGAT TTGAAGAGAT TTTCGAAGAA TATAAATTTG AAATCATGAA AATCCGTCAA 4140 

ACGGGTGGTT GTTTTGTCTC GCACCTCACG GAGCGAGACG GACTCAGAGT CACATAATTA 4200 

TAAGGCTGAT AGTATTAATC TAACTATCAG CtTmCAGGTT ATTTAACGTT TCAGAAAAAC 4260 

TATAATGTCA AGATTAACTA AACAGTATCT AGTTCCTTCA AATAATTTTC TATCTTCATC 4320 

AACATTAAAG GATTGTTATA AATCTTACAT AACTCTCTTG CTTCTATATA ATAATTTTTG 4380 

ACTTGTTCTC TGTCTAGAAA TTTGGCTCCA GCATTTCCTA CAAGAATAAG TAGAGGAGCC 4440 

AATTGGTAGC TTGTCTGTCT TTGTTTACAG AGTTCAATCG TTTCAAGAGC TTCTTGGATG 4500 

GCTTCATTAT ATTTTTCCTT TGATACTAGG TAGTGAGCGT AGTTGTAACG AACTCTGATG 4560 

TAGCCAAATA AAAACTCTTG ATGGTCCAAA TTTTTTGTCT GATACAACTC TATTAAATGA 4620 

GAGTAGTTTG CCTCATATTC TTGTTCACGA CCCACTAAGG AATAGAAATT AGATAGAGTA 4680 

TTCAACGCCT TTAAATAAAT CAGAGTATTT GAAGAGACTT TTAATAATAT ATTTTCCAAT 4740 

GACGAAATTG CCTCACACTT ACTGTCATAT TGATAGAAGT CAATTATAGA TTTAATCCAT 4800 

TCAAGGTAAG TTCGGTCTTC TAATGTTAGA AAAGTGCTTC GTTCTACTTC TATTTTATAA 4860 

AGATATTCTA AATCGTCATA ATTTCTGTCA TCTAATAGGC GAGCAGATAG ATGTTTGAAA 4920 

TTAGAGAGGT TAGACTTAAC TTCGATTTGT TCATTGAAAA AGTAATCCAA AGGGACTTCA 4980 

AGTCGTTGAG AGAGTTTGAA TAACAAGTCT GCGGAGGGAA TAAAATGACC TCTTTCAATT 5040 

TTACTAATCT GGCTTTGTTC ACAAATTCCT TCTGCAAGAG TTTGTTGGGA GAGTCT 5096 
(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

ACAAGATAAA AATAAAGGAT TACAATGGGG AATATAAAGT AAACCGGTAA ACCTAAAAAG 60 

AAAGGAGAAA AGATGAAAAT TGTACTTGTA GGGCATGGAC ATTTTGCTAC AGGGATTTAT 120 

AGTTCTTTAC AATTGATTGC AGGTAATCAA GAAAATGTGG AGGCGATTGA CTTTGTGGAA 180 

GGAATGTCAG CAGATGAACT CAAGCAAAAA ATCTTACTTG CAATTTCAAA TGAAGAAGAA 240 
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GTTTTAATCC TAAGTGATCT CTTGGGAGGA TCGCCATTCA AGGTTTCTTC TACCATAATG 300 

GGAGAAAATC CAGCCAAGAC AATGAATGTT CTCTCGGGTT TGAACTTAGC CATGTTAATG 360 

GAAGCAGTCT TTGCTAGAAT GGCTCATAGC TTTGATGAGG TTGTTAATAA ATCAGTAGTG 420 

GCGGCCCAGG GCGGAGTCGT AAATGGTAAA GAATTGTTTT CAACGGATGC AGAGGAAGAG 480 

GAAGAAGATT TCGAATCGGG TATTTAAAGG GTAAAAGAAT GATAAAAAAG GTTACGATTG 540 

AAAAAATAAA ATCGCCTGAG CGCTTCTTAG AAGTACCACT TCTGACGAAA GAAGAAGTCG 600 

GCCAGGCAAT CGATAAGGTT ATTCGGCAGT TAGAACTCAA CCTTGACTAT TTCAAGGAAG 660 

ATTTCCCGAC GCCAGCTACC TTTGATAATG TCTATCCAAT CATGGATAAC ACGGAATGGA 720 

CCAATGGTTT CTGGACAGGA GAACTGTGGT TGGCTTATGA ATACAGTCAA CAGGATGCAT 780 

TTAAAAACAT CGCTCATAAA AATGTTCTTT CTTTCCTGGA TCGTGTCAAT AAGAGAGTAG 840 

AATTGGATCA CCATGATCTC GGCTTCTTGT ACACACCGTC TTGTATGGCT GAATATAAGA 900 

TAAATGGAGA TGGAGAGGCT AGAGAAGCAA CCTTGAAAGC TGCAGATAAG TTGATTGAAC 960 

GCTATCAAGA AAAAGGTGGT TTTATTCAAG CTTGGGGAGA CTTGGGCAAG AAAGAGCATT 1020 

ACCGTTTGAT TATCGACTGC TTGCTCAATA TCCAACTCTT ATTCTTTGCT TATCAAGAAA 1080 

CAGGCGATCA AAAATACTAC GATATTGCAG AAAGCCATTT CTATGCTTCA GCTAATAATG 1140 

TAATCCGTGA TGACGCTTCG TCCTTCCACA CCTTCTATTT TGATCCTGAG ACAGGTCAAC 1200 

CCTTTAAAGG TGTAACGAGA CAAGGGTATA GTGATGATTC ATGCTGGGCA CGTGGTCAAT 1260 

cATGGGGAGT CTATGGTATT CCTTTGACTT ATCGTCACTT AAAAGACGAG tCCTGCTTTG 1320 

ACTTGTTTAA GGGTGTGACC AATTATTTCT TGAATCGTCT GCCAAAAGAT CATGTGTCCT 1380 

ATTGGGATTT GATTTTTAAT GATGGTAGTG ATCAATCACG AGATTCTTCA GCAACAGCTA 1440 

TCGCCGTCTG TGGGATTCAT GAAATGCTAA AACATCTCCC AGAGGTGGAT GCTGACAAAG 1500 

ATATTTATAA ACATGCTATG CATGCCATGC TTCGTTCCTT GATCGAACAT TATGCAAATG 1560 

ATCAATTTAC CCCTGGTGGG ACAAGTCTCC TCCACGGTGT GTACTCATGG CATTCAGGTA 1620 

AAGGAGTGGA TGAAGGCAAT ATCTGGGGTG ACTACTATTA CCTAGAAGCC CTTATCCGTT 1680 

TCTACAAAGA CTGGAACCTA TATTGGTAGG AGGAGAAATA TGACAATGCC AAATATTATT 1740 

ATGACCCGTA TCGATGAACG GTTGATTCAT GGACAAGGAC AACTTTGGGT AAAATACCTA 1800 

GGTTGTAATA CGGTCATTGT TGCCAATGAC GAAGTAAGCA CGGACAAGAT GCAACAAACT 1860 

CTGATGAAAA CAGTTGTGCC AGACTCAGTT GCCATGCGTT TCTTCCCTTT GCAAAAGGTG 1920 

ATTGATATCA TTCACAAGGC TAATCCTGCT CAAACGATCT TTATCGTTGT AAAGGATGTG 1980 

AAGGACGCTT TAACCTTGGT AGAAGGTGGT GTCACTATCA AAGAAATCAA TATTGGGAAC 2040 
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ATTCACAATG CCCCTGGTAA AGAGCAAGTG ACACGCTCCA TCTTCCTGGG TGAAGAGGAC 2100 

AAGGCGGCCC TCAAGGAATT GAGCCAAACT CATCAAGTAA CATTTAATAC GAAAACAACT 2160 

CCAACAGGAA ATGATGGAGC TGTTCAAGTC AACATTATGG ACTATATTTA ACAGAGGAGA 2220 

TCGTTATGTC GATTAATGTA TTTCAAGCGA TTTTAATTGG ATTATGGACA GCTTTCTGTT 2280 

TTAGTGGAAT GCTGTTAGGA ATTTACACCA ATAGATGTAT TGTTCTGTCA TTTGGTGTCG 2340 

GAATTATTCT AGGTGATCTG TCATGCTCTT GCAATGGGAG C C AATGGTG A ATTGG 2395 



(2) INFORMATION FOR SEQ ID NO: 206: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3342 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

CCTTCTTTAG AGGTTAATTT TGCAAAATCG TCGATTGTTA TATAAGGATT ATTATAGAGA 60 

CTGTTCGCAA AGAATCTCTG ATATGTTTTT GAATCTTTTG AATACAAAAC TATCTCTCTA 120 

ATAGCATTGC CATCTGTTCC ATCAATTGGT AAACATACCG TAACTAGAAA AAGAATTATA 1B0 

TTCAAAATAA AAAATTCTGA TGCGTACGGC ACAAATCCCA AAAGTGCTAA TATTGCGACA 240 

ATTAGGTTAG CTCCACCTCC CCCAAAGAAG TAGAACACCA AATTCCTATC ACTATTTTTT 300 

TCATTAGTAA TGTTTCTATT ACTCATTTGA CAATAACCGA ATGCTAATAA CACTGGAAAT 360 

TTGAAATATA TTTTTTTTCT GAAATAGAAG AAAAAGGGAG TAGCAAGCAT CTCTAGTTTA 420 

TAAGATAAAC ATCTTCCCAC TAAAAAATGA CCTAGTTCAT GTAATGTAAT TGATATTAAC 480 

GAAATTAAAA TCAATCGAAA ATAATAGATT AATGAATCAT TTGGAAAAAT TATCAATAAT 540 

AGGAACAATA ACGGAATCAA ACATAAATAT ATGACAGAGT TATTTAATAT TTTCAACATA 600 

ATACCATTCC TCTAAACTAT TAGCTTCAAA AAGGCGTTTT TTCTCCCAAT ACATCTTCTC 660 

AAAATGTTCG GAATCATAAT TTTCTAAAAT TAATTTTAtG TCTGGTAAGC TCTTTCTTGA 720 

TAATCCGTTG TTTTGTACTT AATTTTCCCT TCAAGTACAT CTTCAATTTT ATAAGTTGCC 780 

TCCATCAACT GAGCCTCTGC AATATCTTTG AGTGAATTGG TAATTGAAAC TTGGTGTAAT 840 

ATCTGTCCtS CCATATATGA AAATATATCT CTAAGATATT CTGACACATT ATCAGAGCCG 900 

TTACTCTCAG CAACATCTAA TGTTACAACA AACTTTCCAG CTAATCGAAA AAGATGGCTC 960 

CACCCCCCAA TCCTTTCAAT AAAGTTTTTT GTGTCCACAG ATACGTTTTG TAAATATACA 1020 
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GGAGAAGAGA TAATTATAAT ATCAGACTCT AATAACTCTT 
TCAGCATTAC TTTGCCTATC AATTCCTTTC TTAAACAACT 
ATTTCTAGCT CTGAATTGAA AGGTGTCCTG AAAGATATAT 
ATGATACTTG AAAGTCTCTT AGTATACTCT AAAGTCTTAG 
GCATATATAA ATATTTTATT CATTTTAATT CATCCTCTCA 
TCAAGATAGT ATGGTACAAA AACAGACTTT TGTTGACTCA 
ATTAAACCAA AATCAATACT ATTTTTGGAG TAATTTTGAT 
ATAACAGTAG CATATACCTC AAGCCGTTTA GCAATTAGAA 
TTATTATCTC AACGAAAAGC TACACTATTA AAAATATTTT 
AGTCAATCTT GGTATTTTTA TATTGCTTAA TGAGTGGACA 
CTATAAATTA AGCTAGATTT CAAGTAATGA GGGGATAACT 
CAGTGCGATA TACCTTAAAA AAGTATAAGC AATACCAGTC 
ATCTGGGAAA TTGCTTGTTT GGACGATACG ATACTCTCCT 
AACACTACAC AATAAAGACT CCAATTCCAT ACTAGTATCC 
GTAAAAATTT ATTATGGCCA TACTTCCATG GCAAAATGTA 
AATTCCCTCT GGAACACTTT GGGGATGATT AACTAATGTC 
CTTCAAAGAA TGAATTTTGA TTTTCTCCCT AGGAACTAGT 
TTTTTTAAGT CTTGTCACTT TATAAATATT TTTTAATGTA 
ATGGCCAAAA CTATATCCAA AATTACTATT ATCTCTCTCG 
ATCACCTAAA CTTAATACTA GCCTTAGAAC ACGTTCCTTC 
TCTTACCAGT GTATTAATTA AAGGTAGAAG ACCATTAATA 
ACTTGCAAAA TCAGTCTTTT CAAGCTCAGT TAAAACACTC 
GAGAGTATTT GTATCGTAAT CCTCTATAAT GGATAGAACA 
AGTTAAACCA AATGTGGTCT TAGATAAAGA AACAGATGGC 
ATTGTACAGT TGAGTATATG ATGATTTATC TTTCAATAAT 
TAATATTCCA GCTCTACCCC TATACATATC ATTmCCCGTT 
ACCTTTAAAA TTAACAGGTA TACTCCAAAT TGGATATTCG 
CAAAGAGTCT GCAATATTTT CTACTTCATT ATGCAGAATA 
GAGTTTTTTT CTATTAGATA AGTTTAATTT ATATCCTTTT 
AAAATAAATT TCAATGATAT CAAGTTGCTT TTCTAAATTT 



TTTTTATAAC ACCTCCATCA 1080 

CTTCTGAATC AGAATTAGAT 1140 

CAACATTATT TCTACTAGAA 1200 

AGTTATGATT TCGCACTCCT 1260 

ATTTGAATTT AGTAGATTTT 1320 

CATTATTACA TATGTTTTGT 1380 

TTTAGTTTAA AATCATTTCT 1440 

TAGAACTTTT CTTTATTATA 1500 

ATAGAATTAC ATATTAAACT 1560 

CCTCTATTTT AGAAACAAAA 1620 

ATCTTTTTGT CATTCTGATT 1680 

ACACCTGTAT ACAAAGAAAA 1740 

TCTTTTGATT TATTCATTAC 1800 

ATTTCTTTCA TGTAGTCGAT 1860 

TCATTATCTA AACTAGCTAC 1920 

CCAAATTCTC CACTACACCA 1980 

TGTAAAATTA ATTCTTTATA 2040 

AAAATTACAC CTGATAGTCC 2100 

CTTACATCAT TATATAGCGT 2160 

TCTATTCCTC TCCTATAATA 2220 

TAGTCAGACT TGTTTGAAAC 2280 

TTTATATAAT TTAAGCATGC 2340 

ATGAAATATC CTATATCCCC 2400 

GGAATTGCAG ATAACATTTT 2460 

TTTACATAGT ACATAAACAG 2520 

TGTTCAAGAC ACCATTTAGA 2580 

TCATAAATAT TATTAATAAC 2640 

GTAACTAAAC TTTCATTTGG 2700 

TTTCGCTGAT CAAAGCTTGG 2760 

TCCAAATTAT TATTAGGTAA 2820 
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ATATTTCATA AAATAGTCAT ATCCAGAAAA TTGATGTAGG GAAATAAAAT GATTTCCAAA 2880 

ATCATCGTAG ATTTCATTGA TATTTGTATC TGTATAAAAA ATCGGAATAT CTAATAACCT 2940 

CATTTGTTCA CATTCGCTTG CTACAATACC TTGATTAGAA AACTTATTGC TCCAGAGATT 3000 

TTCCAATGCT TTTTCTCTAT CTAACATTTC TTCATAAAAA TCAGGATGAT ATAAAAAAGA 3060 

TAGTACTGAA GCATAGCTAT TTGTGTCTCT AAAAAGTACC CTTGTCTTTA AACCATACAA 3120 

GTTTGCTTTT AATAGCATTT TAAATTCTTC TGTTTTATTT AACTCTTCAA ATATCAGATA 3180 

AAAATCCCTA AAACCTTTTT TGAAATCTTT TATATACTTA TCAAATTCTA TATCACCATC 3240 

CCGAACAGGC AGGTTTTTCC CACCTTCAAA ATCAATTTTC CCAATATCAA ACTTTACCTT 3 300 

ATCAGTATTT AAATTAATTA AAACTTGACC AGGGATCCTC TA 3342 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 
GAGAAAAGAA TGTTAAAGAA AAATGATATT GTAGAAGTTG 
GAAGGGGCAG GAGTTGCCAA GGTAGATGGT TTGGTCTTTT 
AGTGAAAAAA TTCTCATGCG TGTCCTCAAG GTCAATAAAA 
GAAAAATACC TTGTCCAGTC ACCACACCGT AATCAAGATC 
TCAGGAATCG CGGATTTAGG ACACCTTTCT TATCCAGAAC 
CAAGTCAAGG ACAGTCTCTA CAAGATTGCT GGAATTGCAG 
CTTGGTATGG AACATCCAGT CAAGTATCGC AATAAGGCGC 
AATGGTGTCT TGGAAACAGG ATTTTTCCGT AAGAATTCGC 
GATTTCTTTA TCCAGGATCC TGTCATTGAC CAAGTCGTAG 
CGTCGTTTTG ATTTAAAACC TTATGACGAA AAGGAACAGT 
GTGGTGCGTC GTGGTCACTA TTCAGGACAA ATCATGGTCG 
AAAGTTTTTC GTGTTGACCA ATTGATTGAA CAAGTTATCA 
TCTGTCATGC AAAATATCAA CGACCAGAAT ACCAATGCGA 
ACTCTTTATG GTCAAGACTA TATTACGGAC CAGATGTTGG 



AAATTGTTGA TTTGACCCAT 60 

TTGTAGAGAA TGCTTTACCG 120 

AGATTGGCTT TGGAAAAGTT 180 

TAGATTTGGC TTACCTGCGT 240 

AGCTCAAGTT TAAAACCAAG 300 

ATGTAGAAGT TGCTGAAACG 360 

AGGTGCCCGT TCGTCGAGTG 420 

ATAACCTCAT GCCCCTTGAA 480 

TAGCTCTTCG AGACCTGCTC 540 

CTGGATTGAT TCGGAATCTT 600 

TTTTGGTGAC AACTCGTCCA 660 

AGCAGTTCCC AGAGATTGTG 720 

TTTTTGGTAA GGAGTGGCGC 780 

GAAATGACTT CCAAATCGCT 840 
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GGCCCAGCCT TTTACCAAGT CAATACTGAA ATGGCGGAGA AACTCTATCA AACAGCCATT 900 

GACTTTGCAG AGTTAAAAAA AGATGATGTG ATTATTGATG CCTATTCTGG TATTGGAACC 960 

ATTGGTTTAT CAGTCGCCAA GCATGTCAAA GAAGTCTACG GTGTTGAACT GATTCCAGAA 1020 

GCAGTAGAGA ATAGCCAGAA GAATGCTTCT TTGAACAAGA TTACTAATGC CCACTATGTC 1080 

TGTGACACGG CTGAAAATGC CATGAAGAAA TGGCTCAAGG AAGGTATTCA ACCAACCGTT 1140 

ATCTTGGTTG ATCCTCCACG CAAGGGCTTG ACAGAAAGCT TTATCAAAGC AAGCGCCCAA 1200 

ACAGGAGCCG ATCGCATCGC CTATATCTCC TGCAATGTCG CAACCATGGC GCGTGATATT 12 60 

AAACTATACC AAGAGTTGGG ATATGAATTG AAGAAAGTCC AGCCGGTGGA TCTATTTCCT 1320 

CAAACGCATC ACGTCGAGAC GGTAGCACTT TTGTCCAAAC TCGATGTCGA TAAGCACATA 1380 

AGTGTTGAAA TTGAGCTGGA TGAGATGGAT TTGACAAGTG CGGAGAGCAA AGCAACATAT 1440 

GCTCAAATCA AAGAATATGT TTGGAATAAA TTTGAATTAA AAGTTTCGAC ATTATATATT 1500 

GCACAGATAA AAAAGAAATG TGGAATAGAA TTACGAGAAC ATTACAACAA GTCTAAAAAG 1560 

GATAAACAAA TTATTCCACA GTGTACACCT GAAAAAGAAG AAGCCATCAT GGATGCTTTG 1620 

AGACACTTCA AAATGATTTA ATAGAAAAGA ATGACAGTAT ATGACTTTCT GCATTTATTA 1680 

CATTCCTACT TGGTATAGGA ACAGCTATTA TTCCTTTCTT GCAAGGTATC AATTAGAAAA 1740 

TAGGCTCAAT ATAAAGATTG ATAGGATCAT TTTTATATTT AAAGGAGCGT TGAAATGATT 1800 

GATAAAGGCA ACAAAAAATT TTAGGATAAA TTTGCTAAGT TGTATGCCTC TTTTATGAAA 1860 

AAAGATAAAG AGGTTTATGA TAAAGTTTGT GAATATCTTA GTCCTCATTT GAATAAAGAT 1920 

ATGGAGGTGC TTGAACTTGC TTGTTGGTTT CGTGTCATAA CAGTTATAGA GGCAAATAGT 1980 

TATGTAAATA TAAGGAGTTC AAGACTTCTA CCAAAGTTTA AAACTCAAAA AATAAATAGT 2040 

TGGTGTGCTG CTTACAATAT CCATTTTAAT AATGGATATT GTAAGCAGCA CCCCcAtGAA 2100 

TTTAAAGATT CTTTAAAGAG TCTTATTTTG TGATGAAAAT TTAATATGTA AATCTCAGAC 2160 

GATAGAAATT AAAAACTCTA TCGTCTTTTT TATACTCAAA ATTAGGAGGT AAAAATGGTA 2220 

AGGATAAGAG GTCCCACTTA AAACAATTTA TGGCAAAATA AGGACGGAAT AACACAACAA 2280 

ATTCTCTAAA ACAAATCACT AAATCAATGT AAGATTGAAT GAAATCAATA TTTATGCTAT 2340 

AATTAAATAA ATTTAATGAA GAAAAAAAGA GGGATATTAT GGCACTTAAC TATAAACCAT 2400 

TATGGATACA GTTAGCAAAA AAAGGACTAA AGAAAACAGA TGTAATAGCT ATGGCAGGAC 2460 

TTACAACAAA TGTTATGGCA CAAATGGGAA AGGATAAACC AATTACATTT AAGAATTTAG 2520 

AAAGAATATG TAAGGCTTTA TCTTGCACTC CTAATGATAT TATTAGTTTT GAAGATAATT 2580 

TTAGTGACGA GGAATAGAAA ATGACTTTAA GGACAGAAGA TCAAGTTAGG GATTATGCAA 2640 
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GAGAAGTATA GGCTTTAATG AAGTTGAAGA AAACATCAAT CAAGGTACTG GTCAAATAAC 2700 

TACTTTTAAT CAATTAGGCT TCAAGGGATA TTCAAATAAG CCAGATGGTT GGTATTTACC 2760 

TAAAAATATG AATGATGTAG CAATAATCCT TGAAACAAAA TCAGAAGAAA GAGATATTAG 2820 

CAAACAAATT TTTATTGATG AGTTAATGAA AAATATAGAC ATAATTTAAC TAAAAATAAA 2880 

AACTAGATCC TTTTTTGAAA AAATTATATT ATTAAATTTG TAACTGTATC TATTGACAAT 2940 

GATAATTATT ATCGATACAA TAGACTTGAA ATATGTTTAA GGAGTTTTTA TGAAAaCAAA 3000 

TTTTTTCTAA TmGCTATTTT AGCTATGTGT ATAGTTTTTA GCGCTTGTTC TTCTAATTCT 3060 

GTTAAAAATG AAGAAAATAC TTCTAAAGAG CATGCGCCTG ATAAAATAGT TTTAGATCAT 3120 

GCTTTCGGTC AAACTATATT AGATAAAAAA CCTGAAAGAG TTGCAACTAT TGCTTGGGGA 3180 

AATCATGATG TAGCATTAGC TTTAGGAATA GTTCCTGTTG GATTTTCAAA AGCAAATTAC 3240 

GGTGTAAGTG CTGATAAAGG AGTTTTACCA TGGACAGAAG AAAAAATCAA AGAACTAAAT 3300 

GGTAAAGCTA ACCTATTTGA CGATTTGGAT GGACTTAACT TTGAAGCAAT ATCAAATTCT 3360 

AAACCAGATG TTATCTTAGC AGGTTATTCT GGTATAACTA AAGAAGATTA TGACACTCTA 3420 

TCAAAAATTG CTCCTGTAGC AGCATACAAA TCTG 3454 



(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

CGGGAGTATA CTTAATATAA TTATAGTCTA AAAATGACTA TCAGAAAAGA GGTAAATTTA 60 

GATGAATAAG AAAAAAATGA TTTTAACAAG TCTAGCCAGC GTCGCTATCT TAGGGGCTGG 120 

TTTTGTTACG TCTCAGCCTA CTTTTGTAAG AGCAGAAGAA TCTCCACAAG TTGTCGAAAA 180 

ATCTTCATTA GAGAAGAAAT ATGAGGAAGC AAAAGCAAAA GCTGATACTG CCAAGAAAGA 240 

TTACGAAACG GCTAAAAAGA AAGCAGAAGA CGCTCAGAAA AAGTATGAAG ATGATCAGAA 300 

GAGAACTGAG GAGAAAGCTC GAAAAGAAGC AGAAGCATCT CAAAAATTGA ATGATGTGGC 360 

GCTTGTTGTT CAAAATGCAT ATAAAGAGTA CCGAGAAGTT CAAAATCAAC GTAGTAAATA 420 

TAAATCTGAC GCTGAATATC AGAAAAAATT AACAGAGGTC GACTCTAAAA TAGAGAAGGC 480 

TAGGAAAGAG CAACAGGACT TGCAAAATAA ATTTAATGAA GTAAGAGCAG TTGTAGTTCC 540 
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TGAACCAAAT GCGTTGGCTG AGACTAAGAA AAAAGCAGAA GAAGCTAAAG CAGAAGAAAA 600 

AGTAGCTAAG AGAAAATATG ATTATGCAAC TCTAAAGGTA GCACTAGCGA AGAAAGAAGT 660 

AGAGGCTAAG GAACTTGAAA TTGAAAAACT TCAATATGAA ATTTCTACTT TGGAACAAGA 720 

AGTTGCTACT GCTCAACATC AAGTAGATAA TTTGAAAAAA CTTCTTGCTG GTGCGGATCC 780 

TGATGATGGC ACAGAAGTTA TAGAAGCTAA ATTAAAAAAA GGAGAAGCTG AGCTAAACGC 840 

TAAACAAGCT GAGTTAGCAA AAAAACAAAC AGAACTTGAA AAACTTCTTG ACAGCCTTGA 900 

TCCTGAAGGT AAGACTCAGG ATGAATTAGA TAAAGAAGCA GAAGAAGCTG AGTTGGATAA 960 

AAAAGCTGAT GAACTTCAAA ATAAAGTTGC TGATTTAGAA AAAGAAATTA GTAACCTTGA 1020 

AATATTACTT GGAGGGGCTG ATCCTGAAGA TGATACTGCT GCTCTTCAAA ATAAATTAGC 1080 

TGCTAAAAAA GCTGAGTTAG CAAAAAAACA AACAGAACTT GAAAAACTTC TTGACAGCCT 1140 

TGATCCTGAA GGTAAGACTC AGGATGAATT AGATAAAGAA GCAGAAGAAG CTGAGTTGGA 1200 

TAAAAAAGCT GATGAACTTC AAAATAAAGT TGCTGATTTA GAAAAAGAAA TTAGTAACCT 1260 

TGAAATATTA CTTGGAGGGG CTGATTCTGA AGATGATACT GCTGCTCTTC AAAATAAATT 1320 

AG CT ACT AAA AAAGCTGAAT TGGAAAAAAC TCAAAAAGAA TTAGATGCAG CTCTTAATGA 1380 

GTTAGGCCCT GATGGAGATG AAGAAGAAAC TCCAGCGCCG GCTCCTCAAC CAGAGCAACC 1440 

AGCTCCTGCA CCAAAACCAG AGCAACCAGC TCCAGCTCCA AAACCAGAGC AACCAGCTCC 1500 

TGCACCAAAA CCAGAGCAAC CAGCTCCAGC TCCAAAACCA GAGCAACCAG CTCCAGCTCC 1560 

AAAACCAGAG CAACCAGCTA AGCCGGAGAA ACCAGCTGAA GAGCCTACTC AACCAGAAAA 1620 

ACCAGCCACT CCAAAAACAG GCTGGAAACA AGAAAACGGT ATGTGGTATT TCTACAATAC 1680 

TGATGGTTCA ATGGCAATAG GTTGGCTCCA AAACAACGGT TCATGGTACT ACCTAAACGC 1740 

TAACGGCGCT ATGGCAACAG GTTGGGTGAA AGATGGAGAT ACCTGGTACT ATCTTGAAGC 1800 

ATCAGGTGCT ATGAAAGCAA GCCAATGGTT CAAAGTATCA GATAAATGGT ACTATGTCAA 1860 

CAGGAATGGC GCTATGGCGA CAGGCTGGCT CCAATACAAT GGCTCATGGT ACTACCTCAA 1920 

CGCTAATGGT GATATGGCGA CAGGATGGCT CCAATACAAC GGTTCATGGT ATTACCTCAA 1980 

CGCTAATGGT GATATGGCGA CAGGATGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2040 

CGCTAACGGT GCTATGGCTA CAGGTTGGGC TAAAGTCAAC GGTTCATGGT ACTACCTAAA 2100 

CGCTAACGGT TCAATGGCAA CAGGTTGGGT GAAAGATGGA GATACCTGGT ACTATCTTGA 2160 

AGCATCAGGT GCTATGAAAG CAAGCCAATG GTTCAAAGTA TCAGATAAAT GGTACTATGT 2220 

CAATGGCTTA GGTGCCCTTG CAGTCAACAC AACTGTAGAT GGCTATAAAG TCAATGCCAA 2280 

TGGTGAATGG GTTTAAGCCG ATTAAATTAA ATCATGTTAA GAACATTTGA CATTTTAATT 2340 
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TTGAAACAAA GATAAGGTTC GATTGAATAG ATTTATGTTC GTATTCTTTA GGTACCTATC 2400 

TTATGATTTC AGGAAATGTC ATTAAAAAAA CGACTCATTT TCTCTAACCT GAAAAATAGA 2460 

TTAGAGAAAA TGGGTTGTTT TATCTATTAT AGTTATTTGA ATGAAGmTAA GAAGAAGGTA 2520 

TACTCACATC ATTCACATAA TCTGTATATT GACTATAAGT TTTAAAAAAC AATTTTTAAG 2580 

CTCTTCCTTG TCTTCTCTAA CCAAGCGTGT TATAATGAAT ACTGCTCAAG CGACCTTCAA 2640 

TCGTGAAGCA CACACGACCT TCAATCGTGA ATAAACGAAT AGATGGGAGA CTTACCATGA 2700 

GTGATAACTC TAAAACACGT GTTGTCGTGG GGATGAGTGG TGGTGTTGAT TCGTCGGTGA 2760 

CGGCTCTTTT GCTCAAGGAG CAGGGCTACG ATGTGATCGG TATCTTCATG AAGAACTGGG 2820 

ATGACACAGA TGAAAACGGC GTCTGTACGG CGACCGAAGA TTACAAGGAT GTGGTTGCGG 2880 

TGGCAGACCA GATTGGCATT CCCTACTACT CTGTCAATTT TGAAAAAGAG TACTGGGACC 2940 

GCGTTTTTGA GTATTTCCTA GCGGAATACC GTGCAGGGCG CACGCCAAAT CCGGACGTTA 3000 

TGTGCAACAA GGAAATCAAG TTCAAGGCCT TTTTGGACTA TGCCATAACC TTGGGGGCAG 3060 

ACTATGTAGC GACTGGGCAT TATGCTCGAG TGGCGCGTGA TGAGGATGGT ACCGTTCACA 3120 

TGCTTCGTGG CGTGGACAAT GGCAAGGATC AGACCTATTT CCTCAGCCAA CTTTCGCAAG 3180 

AACAACTTCA AAAAACCATG TTCCCACTAG GACATTTGGA AAAGCCTGAA GTACGCAGAC 3240 

TAGCAGAAGA AGCAGGCCTT TCGACTGCTA AGAAGAAAGA CTCGACAGGG ATTTGCTTTA 3300 

TCGGAGAAAA GAACTTTAAA AACTTTCTCA GCAACTACCT GCCAGCTCAG CCTGGTCGCA 3360 

TGATGACTGT GGATGGTCGC GATATGGGCG AGCATGCAGG TCTTATGTAC TATACAATCG 3420 

GTCAGCGTGG CGGACTCGGT ATCGGTGGGC AACACGGCGG TGACAATGCC CCTTGGTTCG 3480 

TTGTCGGAAA AGATCTAAGC AAGAATATTC TCTATGTAGG ACAAGGATTC TACCATGATT 3540 

CGCTCATGTC AACTAGCCTA GAAGCCAGTC AAGTCCACTT TACTCGTGAA ATGCCAGAAG 3600 

AGTTTACGCT AGAATGTACG GCTAAATTCC GTTACCGTCA GCCTGACTCT AAGGTGACCG 3660 

TTCATGTCAA AGGAGAAAAG ACAGAGGTCA TCTTTGCGGA ACCACAACGC GCGATTACAC 3720 

CAGGACAGGC AGTTGTCTTT TACGATGGCG GG 3752 
(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



TATTTATATT 


TTTTTATCTC 


TGGCATACTT 


TGATACCTTT 


TTAGACTTAA 


AGTCTTTAAT 


60 


AGTGCCTTTC 


CACCTCTTTT 


TATCTATAAA 


GATTCTCCTA 


CATCATAATT 


CATTTTTTTA 


120 


TTTAAACCTT 


TCTGTCTTAG 


TTTGTCTTTA 


TCTTCTTCAT 


ACCATTTTAA 


GATTGTCACA 


180 


TAGTGGTTTT 


GATAGGTCTT 


ACCACTGCTT 


TCCATGTATC 


TGGATAGTTT 


ATTTATCATT 


240 


ATATCTGTGT 


GTGAGTTTAA 


TTT T TCTT TT 


AGATTTTTAT 


ATTCTTCTTT 


GCTTAACCTT 


300 


ACATTTTTGA 


ATTCTCCATA 


AAAAATGGGG 


GTGGACTTTT 


TATCTATCTC 


TCCCTCTCTC 


360 


TCTTTATCTA 


TCTCTATATC 


TTTCCATGTA 


ATTCCAATCT 


GGAGTACCTC 


TACTGTCTAT 


420 


CGGTAATTTA 


ATTTTGATAT 


CTGGCAATAC 


TGTGCTAGAT 


ATTTGATCTT 


TATATTCAGT 


480 


ATTTTTTAAA 


GCTTGCCTAA 


TAATTGAAGT 


TAAATAGAAT 


GCTACTTCTT 


TATTCAATTC 


540 


TTTATTTTTT 


AATTTTAAAC 


AATGAATTTT 


CATATCTAGG 


CTTGCTTTAT 


ATTTATGATA 


600 


AAAGACTGCT 


CCTAAAAATG 


AAACAGATAT 


AAAATTTTCA 


AAAACTCTAT 


AATTTTTATC 


660 


ATCTATATCT 


TCGTAGTAAC 


CTAAGATACC 


ATTGTCAATA 


TTTGTAGCAC 


TAATTCTAGG 


720 


AGTTTTTCCA 


TCGAGTAAAT 


ATCTTTTTGG 


AATAGATGAG 


CCTGTTGGTA 


CTTAACTCGA 


780 


TTTCCCCTTT 


TTTTCGGTAA 


TAAATATTTC 


TTTTTATTTT 


GTTGTCTGAT 


ATTTTTCCTA 


840 


CCTGTCCTTT 


GTAGGATGAG 


TATTTTCTAG 


ATTTTCyTGA 


ATAACTTTTT 


ACTTGAAGTT 


900 


TTAGCTTTTG 


AACTAGTCGT 


TGTACTTTCT 


TTTTGTTTAT 


TATCAGTCCT 


GATCTTTTTA 


960 


ATATTGCTGT 


TATTCTCTAT 


ATCCTATTTT 


TCATTCATGA 


TATTCTTTTA 


CTAATTTTAT 


1020 


CTTAAATTCT 


GTGCTGTATT 


TGCCATTAAA 


AAACTGACCT 


CCTTTAGTTA 


GTTTTTTGGC 


1080 


CTAACTTTTG 


AGGGTCAGTT 


CAAAATTTGC 


GACTTTTAAA 


TGAATTCCAA 


TATTCAATTA 


1140 


TTAAGAGTTA 


ACATGGTGCT 


TGCCAATAGG 


AATCATTAGA 


GGCGAATTGG 


AAATAGGGTC 


1200 


ACGTATAATT 


TTTGCTTCAA 


GATTAAAGAT 


ATCTTTAACT 


AGTTTATCAT 


TTAGTATATC 


1260 


TTCAGGCTTT 


CCCTCTGCAA 


CAAGTTTACC 


TTCTTTAATT 


GCAAATAGGT 


AATCAGCGTA 


1320 


TCTTGCTGTT 


AGATTTATAT 


CGTGCAAAAT 


CATGCAAATG 


GTTGTCTTAT 


ATTTTTGGTT 


1380 


TAGATCAGTC 


AAGAGGTCTA 


ATAGTTCTAT 


TTGATATGAG 


ATATCCAAGT 


AAGTAGTTGG 


1440 


CTCATCTAAA 


AGTAGGATAC 


TTGTATCTTG 


GGCTAGGGCT 


AGAGCTATCC 


ATACTCTTTG 


1500 


CCTTTGACCC 


CCAGAAAGTT 


CTTCAACTAG 


GTTATTTGCT 


AGATCTTCAA 


CATTGGCCTT 


1560 


AACCATTGAT 


CTGTTTATTA 


TTTCAAGGTC 


ATCTTTTCCA 


AGACTCTTAA 


AAGGCTTTCT 


1620 


GTAGGGGAAA 


CGACCACGGC 


TTACAAGATC 


AGCTACTGTT 


ATTGATTCAG 


GGATTATTGG 


1680 


AGATTGAGGT 


AATATAGCTA 


TGTGTTTTGC 


TAAATCTTTT 


TCTTTATAAG 


AATTAATTGA 


1740 
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TTTATTATCA AGCAATACTT CTCCCTCTAA TGGCTTTATA AGTCGAGACA AGGTTTTAAT 1800 

GAGTGTTGAT TTCCCACAAC CATTTGACCC AATAATAACT GATATTTTTT CTTCAGGTAT 1860 

TTTTATATTT ATATTTTCCA AGATTATTTT TTCATCATAA CCGCAGGTAA GATTATTTGA 1920 

CCACAGACCT TTCATTATAT ATTCCTCCTG TTCATTTTTA TTAGTAAGTA TATTAAGTAT 1980 

GGTGAACCTA ACAAGCCAGT TACAACACCT ACTGGATATC TAGCTGGTAA AATATTTTGA 2040 

GAGAATATGT CTGATAACAA AACTAGTAAA ATTCCAACCA ATCCAGCTAA TATTGGGCTT 2100 

CTTTTCTTGC CAATATTTAA GGCTATGGGA CCAGCTAAAA AAGATATACA AGCTATTGGT 2160 

CCTGTAATTG AAGTAGAAAA AGCAGTTAAA GATACAGCGC AAAAAATTAA AACAAGCCTT 2220 

GAAAGCTCGG GATTTGCTCC AAGTCCGATT GCTATTTCTT CACCAAGTTC AATAATTTCT 2280 

AGTCTTTTAT TAAAAAATAA AACTAATATA GTAGCAATAA TACTTACTAT TAGAACAAGA 2340 

GGTATGTCAT CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTGT 2400 

AATTCATATC TTGCTACTTT CAACAATAAA AATGAGGTGC CTGCTCTTGT GACAGCTTGA 2460 

AAACCAATAC CTAATATTAT CAGTCTTGCT GCTGAAAAAC CATCTTTTTT AGCTAGTAAA 2520 

AATAATATTA AAGATGATGT TAGTCCACAA GTTATTGAAA TAATTCCAGT AGTTAAACTA 2580 

TTTGTTTTTA ATACCAATAT GCAAAAGACC GCTGCAATAG ATGAAGAACT TGTGACACCG 2640 

ATTATATCAG GACTTGCAAG AGGATTTCTT AACATAGTTT GAAAGATAAA TCCTGCCAAT 2700 

CCAAAAGACC AGCCAGCTAT AATTCCTGCT AATAATTTTG GTAATCTAAT TTCCATAATC 27 60 

GAAAAACTAG CTCCAGGAAC AGTTTCACTA TTTAAGACTT TAATCAAAGT TGAAAAAGAA 2820 

TAACTTTCAT CTCCGATAAG TAAAATGAAA AATGATAGAC TGATTATTAT TAATAAAAAT 2880 

AGTGAGGAAA ATAGTGTTAT TCTATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 2940 

ATTAGTTATT AACCCCTCTA TTTTTCATAG TTACATAAAT AAGTACTGGA CCCCCGATTA 3000 

TTGCAGTAAT TATCCCTACT TCAATTTCAC CTGGTTTACC TAACATACGG CCGATTATAT 3060 

CACATATAAG CAAGAGCTCT GCACCTATAA AAGATGAAGA AATGGTCATT GTGCGTATAT 3120 

CTTTGCTTAT AAATAAGCCA CAAAAGTGAG GAACTATAAG ACCTACGAAG CCAATAGGTC 3180 

CACCAATTGC AGTAATACTT GAACATAAAA GCACACTTGC AATTATTGCA AGTGATCTTA 3240 

TCCTATTAAC ATTAACTCCA AGACCAACAG CCATTTCATC ACCCATAGcT AAAGCGTTTA 3300 

AATCTGATGA AATAAATATA GCTATCAAGT GACCTAAAAT TATAAAAGGT AGTAGTGTAG 3360 

ATATAGAAGA TAATGTAGCT GCTCCAAGGC TACCTATTTG CCAAAATCTA AATTTGTCTA 3420 

AGACGTTATT ATTCGGTAAA ATTAAAAAAC TTACAAAACT GCTTAAAGCC ATACTAACAC 3480 
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AAGTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCGTTA CAGCAATCGC 3540 
GTATACAAAA ATTGCACTTA CTAAGCCACC AATGATTGCG 3580 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

CCAAATTGCT CCACAATTAT TATGGAGTCG TCGTTTGGCA GATGGGCGTG ATATGTGTGC 60 

TCAAGAATGG TTGACAGGCA AGATATTGAC CCCCTATGAT ATGAATCGTA AGCAAATCGT 120 

CAATATTTTA ACCCGTCTTC ATCGCTCACG TCCGTTGATG ACACAATTGA GTCGTTTGGG 180 

CTATGCCATG GAAACACCTG TAGATTTACT ACAGTCTTGG CAGGAAACGG CTCCAGATGC 240 

TTTGCGTAAA AATCATTTTA TCAGTGAAGT GATGGCTGAT TTACGTCAGA CTATTCCAGG 300 

ATTTAGAGAG GACCATGCGA CCATTGTCCA TGGAGATGTA CGACATAGTA ATTGGATTGA 360 

GACAGATAGT GGCTTGATTT ATTTAGTAGA TTGGGATTCG GTTCGCTTGA CCGATCGCAT 420 

GTTTGATGTG GCCCATATGC TCTGCCATTA TATTTCAGAA CATCAGTGGA AGGAATGGTT 480 

GACCTACTAC GGTTACAAGT ACAATCAAAC GGTATTAAGT AAATTGTATT GGTATGGTCA 540 

ATTGTCTTAT TTGAGTCAGA TTTCCAAGTA TTATATGAAC CAAGATTTAG AAAATGTCAA 600 

TCGGGAGATT CATGGTTTGC GTCATTTCCG AGACAAGTAT GGAAAGAGAA GATGAGAGTT 660 

AGAAATCGTA AAGGGGCAAC AGAATTACTA GAGGCAAATC CCCAGTATGT GGTCCTCAAT 720 

CCCTTGGAAG CCAAGGCAAA ATGGCGGGAC TTGTTTGGCA ATGATAATCC CATTCATGTG 780 

GAAGTTGGAA GTGGAAAGGG TGCCTTTGTT TCAGGTATGG CCAAGCAAAA CCCTGACATC 840 

AACTATATCG GGATTGATAT TCAAAAGTCT GTTTTGAGCT ACGCTTTGGA CAAGGTGCTT 900 

GAAGTTGGAG TGCCTAACAT CAAGCTCTTG TGGGTAGATG GTTCTGACTT AACTGACTAC 960 

TTTGAAGACG GTGAGATTGA TCGCTTGTAT CTGAACTTTT CAGATCCATG GCCGAAAAAA 1020 

CGCCATGAAA AGCGTCGTTT GACCTACAAG ACCTTCTTGG ATACCTTCAA ACGTATCTTG 1080 

CCTGAAAATG GAGAAATTCA TTTCAAGACG GATAACCGTG GCTTGTTTGA GTACAGTTTA 1140 

GTGAGCTTTT CTCAATATGG CATGAAACTC AATGGTGTCT GGTTAGATTT GCATGCCAGT 1200 

GATTTTGAAG GCAATGTCAT GACAGAATAC GAGCAAAAAT TCTCAAACAA GGGGCAAGTT 1260 

ATCTACCGAG TTGAGGCAGA ATTTTAAGAG ATAACCTAAA ATTAGGCTGT ACAAGTGCTT 1320 
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TTGCTTTACA TAAGTTGGCA AACGTGCTAT 


ACTGATAGTA 


AGAATATGAA 


AAGTGAGGCG 


1380 


GGGAAATATC 


TTCGCCTCTT 


GCTTATGAGG 


AGGTGGACGC 


AATCGCAACA 


ATCGTAGAAT 


1440 


TAGTCAGAGA 


AGTTGTAGAA 


CCTGTCATAG 


AAGCTCCTTT 


TGAACTCGTG 


GATATCGAGT 


1500 


ATGGAAAGAT 


TGGCAGTGAC 


ATGATTCTCA 


GTATTTTTGT 


AGATAAACCC 


GAAGAATTAC 


1560 


CTTGAACGAC 


ACGGCAGACT 


TGACAGAAAT 


TATCAGTCCT 


GTCCTAGACA 


CCATCAAGCC 


1620 


AGATCCCTTC 


CCAGAACAAT 


ATTTCCTAGA 


AATTACCAGT 


CCAGGTTTGG 


AACGTCCTTT 


1680 


GAAAACCAAG 


GATGCCGTCG 


CTGGAGCGGT 


TGGAAAATAC 


ATCCATGTCG 


GGCTCTACCA 


1740 


AGCCATCGAT 


AAGCAAAAGG 


TCTTTGAAGG 


AACCTTGTTG 


GCCTTCGAAG 


AGGACGAGTT 


1800 


GACTATGGAA 


TATATGGACA 


AGACGCGTAA 


GAAAACCGTC 


CAAATTCCAT 


ACAGTTTAGT 


1860 


ATCAAAAGCA 


CGTTTAGCAG 


TTAAATTATA 


GAAAAAGAAA 


GGATAGCTTT 


TGAGGATTCA 


1920 


AAAGTGAAGA 


AAACATGAGT 


AAAGAAATGC 


TAGAGGCCTT 


CCGCATTTTG 


GAAGAAGACA 


1980 


AGGGAATCAA AAAAGAAGAT 


ATCATCGACG 


CAGTAGTAGA 


GTCGCTTCGT 


TCCGCTTATC 


2040 


GCAGACGCTA 


TGGTCAGTCA 


GACAGCGTAG 


CTATTGACTT 


CAACGAAAAA 


ACAGGTGACT 


2100 


TTACAGTTTA 


TACTGTCCGT 


GAAGTTGTTG 


ATGAAGTATT 


TGATAGCCGT 


TTGGAAATCA 


2160 


GCTTGAAAGA 


TGCTCTTGCC 


ATTAATTCAG 


CTTATGAACT 


TGGAGACAAA 


ATCAAGTTTG 


2220 


AAGAAGCACC 


AGCTGAGTTT 


GGTCGTGTAG 


CAGCCCAATC 


TGCCAAACAA 


ACCATCATGG 


2280 


AAAAAATGCG 


CAAgCAAACA 


CGTGCCATCA 


CTTACAATAC 


TTACAAAGAA 


CATGAGCAAG 


2340 


AAATCATGTC 


TGGTACAGTA 


GAACGCTTTG 


ACAACCGCTT 


TATCTATGTC 


AACCTTGGTA 


2400 


GCATCGAAGC 


CCAATTGTCA 


AAACAAGACC 


AAATTCCTGG 


AGAAGTTTTT 


GCTTCTCATG 


2460 


ATCGTATCGA 


AGTTTATGTT 


TACAAGGTTG 


AAGACAACCC 


TCGTGGTGTG 


AACGTCTTTG 


2520 


TTAGCCGTAG 


TCATCCAGAA 


ATGATCAAAC 


GTTTAATGGA 


GCAAGAAATT 


CCAGAAGTTT 


2580 


ATGATGGAAC 


TGTTGAAATC 


ATGAGCGTGG 


CTCGTGAAGC 


AGGTGACCGT 


ACGAAGGTTG 


2640 


CTGTTCGTAG 


CCACAATCCA 


AACGTGGATG 


CTATCGGTAC 


AATCGTTGGA 


CGTGGTGGTG 


2700 


CTAATATCAA 


GAAGATTACT 


AGCAAATTCC 


ACCCAGCTCG 


TTACGATGCT 


AAAAATGACC 


2760 


GCATGGTACC 


AATCGAAGAA 


AATATCGATG 


TTATCGAGTG 


GGTAGCAGAT 


CCAGCTGAAT 


2820 


TTATCTACAA 


TGCCATCGCT 


CCTGCTGAGG 


TTGACCAAGT 


TATCTTTGAT 


GAAAACGACA 


2880 


GCAAACGTGC 


CTTGGTGGTT 


GTTCCAGATA 


ACAAGCTTTC 


TCTTGCCATT 


GGTCGTCGTG 


2940 


GACAAAACGT 


GCGCTTGGCG 


GCTCACTTGA 


CTGGTTACCG 


TATCGATATC 


AAGTCTGCTA 


3000 


GCGAATTTGA AGCCATGGAA 


GACGCTGCTT 


CAGTAGAGTT 


GGAAGTAGAA 


AACGATACTG 


3060 
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TAGAAGAATA AAAGCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 3120 

AGTCTGTTGT GTCTAACGAA GTGATTGATA AGCGTGATTT GCTCCGCATT GTCAAGAACA 3180 

AGGAAGGACA AGTCTTTATT GATcCTACGG GCAAGGCCAA TGGCCGCGGC GCTTATATCA 3240 

AACTAGACAA TGCAGAAGCC CTAGAGGCGA AAAAGAAGAA GGTCTTTAAC CGCAGCTTTA 3300 

GCATGGAAGT GGAAGAAAGC TTTTATGACG AGTTGATCGC TTATGTGGAT CACAAAGTGA 3360 

AAAGAAGAGA GTTGGGACTT GAATAAGCAA AAGATAAGTA ATCTCTTGGG GCTTGCTCAG 3420 

CGAGCAGGGC GCATCATATC GGGTGAAGAA TTGGTGGTCA AGGCCATTCA AGACGGCAAG 34 80 

GCCAAGTTGG TCTTTCTAGC TCATGATGCT GGACCCAATC TGACCAAGAA GATTCAAGAT 3540 

AAAAGTCATT ATTATCAAGT AGAAATTGTA ACCGTGTTTT CAACACTGGA ATTAAGCATA 3600 

GCAGTCGGGA AATCGAGAAA GGTTTTGGCT GTAACAGATG CTGGATTTAC AAAGAAAATG 3660 

AGGTCTCTTA TGGAATAGAA GAGGAGGACA TGATTTGTCT AAGAAAAGAT TGTACGAAAT 3720 

CGCAAAAGAA CTTGGAAAAG AAAGTAAAGA AGTTGTAGCG CGTGCAAAAG AGTTGGGCTT 3780 

GGATGTGAAA AGCCACTCAT CAAGTGTGGA AGAAGCTGTC GCTGCAAAAA TTGCTGCCAG 3840 

CTTTAAGCCT GCAGCTGCTC CGAAAGTAGA AGCAAAACCT GCAGCCCCAA AAGTAAGTGC 3 900 

AGAAAAGAAA GCCGAAAAAT CTGAGCCAGC TAAACCAGCT GTAGCTAAGG AAGAGGCAAA 3960 

ACCTGCAGCC CCAAAAGCAA GTGCAGAAAA GAAAGCCGAA AAGTCTGAAC CAGTAAAACC 4020 

AGCTGTAGCC AAGGAAGAGG CAAAACCAGC TGAGCCAGTC ACTCCGAAAA CAGAAAAAGT 4080 

AGCGGCTAAA CCGCAAAGTC GTAATTTCAA GGCTGAGCGT GAAGCACGTG CTAAAGAGCA 4140 

GGCAGAGCGA CGCAAGCAAA ATAAGGGCAA TAACCGTGAC CAACAACAAA ACGGAAACCG 4200 

TCAGAAAAAC GACGGCCGTA ATGGTGGAAA ACAAGGTCAA AGCAACCGCG ACAATCGTCG 4260 

CTTTAATGAC CAAGCTAAGA AGCAGCAAGG TCAGCAAAAA CGTAGAAATG AGCGCCGTCA 4320 

GCAAGAGGAT AAACGTTCAA ATCAAGCGGC TCCACGTATT GACTTTAAAG CCCGTGCAGC 4380 

AGCCCTAAAA GCAGAGCAAA ATGCAGAGTA CGCTCGTTCA AGTGAGGAAC GCTTCAAGCA 4440 

GTATCAGGCT GCTAAAGAAG CCTTGGCTCA AGCTAACAAA CGCAAGGAAC CAGAGGAAAT 4500 

CTTTGAAGAA GCGGCTAAGT TAGCTGAACA AGCACAGCAA GTTCAAGCAG TGGTTGAAGT 4560 

CGTCCCTGAG AAAAAAGAAC CTGCAGTGGA TACACGTCGT AAAAAACAAG CTCGACCAGA 4620 

CAAAAATCGT GACGATTATG ATCATGAAGA AGATGGTCCT AGAAAACAAC AAAAGAATCG 4680 

AAGTAGTCAA AATCAAGTGA GAAATCAAAA GAATAGTAAC TGGAATAACA ACAAAAAGAA 4740 

CAAAAAAGGC AATAACAAGA ACAACCGTAA TCAGACTCCA AAACCTGTTA CGGAGCGTAA 4800 

ATTCCATGAA TTGCCAACAG AATTTGAATA TACAGATGGT ATGACCGTTG CGGAAATCGC 4860 
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AAAACGTATC AAACGTGAAC CAGCTGAAAT TGTTAAGAAA CTTTTCATGA TGGGTGTCAT 4920 

GGCCACACAA AACCAATCCT TGGATGGGGA AACAATTGAA CTCCTCATGG TGGATTACGG 4980 

TATCGAAGCC AAACAAAAGG TTGAAGTGGA TAATGCTGAC ATCGAACGTT TCTTTGTCGA 5040 

AGATGGTTAT CTCAATGAAG ATGAATTGGT TGAGCGTCCA CCAGTTGTTA CTATCATGGG 5100 

ACACGTTGAC CACGGTAAAA CAACCCTTTT GGATACTCTT CGTAACTCAC GTGTTGCGAC 5160 

AGGTGAAGCA GGTGGTATTA CTCAGCATAT CGGTGCCTAC CAAATCGTGG AAAATGGTAA 5220 

GAAGATTACC TTCCTTGATA CACCAGGACA CGCGGCCTTT ACATCAATGC GTGCGCGTGG 5280 

TGCTTCTGTT ACCGATATTA CGATCTTGGT CGTAGCGGCA GATGACGGGG TTATGCCTCA 5340 

GACTATTGAA GCCATCAACC ACTCAAAAGC AGCTAACGTT CCAATCATCG TAGCTATTAA 5400 

CAAGATTGAT AAACCAGGTG CTAACCCAGA ACGCGTTATC GGTGAATTGG CAGAGCATGG 5460 

TGTGATGTCA ACTGCTTGGG GTGGAGATTC TGAATTTGTT GAAATTTCGG CTAAATTCAA 5520 

CCAAAATATC GAAGAATTGT TGGAAACAGT CCTTCTTGTG GCTGAAATCC AAGAACTCAA 5580 

AGCAGACCCA ACAGTTCGTG CGATCGGTAC GGTTATCGAA GCGCGCTTGG ATAAAGGAAA 5640 

AGGTGCGGTC GCAACCCTTC TTGTACAACA AGGTACCTTG AATGTTCAAG ACCCAATCGT 5700 

TGTCGGAAAT ACcTTCGGTC GTGTCCGTGC TATGACCAAC GACCTTGGTC GTCGTGTTAA 57 60 

AGTTGCTGGA CCATCAACAC CAGTCTCTAT CACAGGTTTG AACGAAGCAC CGATGGCGGG 5820 

TGACCACTTT GCCGTTTACG AGGATGAAAA ATCTGCGCGT GCAGCAGGTG AAGAGCGTGC 5880 

CAAACGTGCC CTCATGAAAC AACGTCAAGC TACCCAACGT GTTAGCCTTG AAAACCTCTT 5940 

TGATACCCTT AAAGCTGGGG AACTCAAATC TGTTAATGTT ATCATCAAGG CTGATGTACA 6000 

AGGTTCTGTT GAAGCCCTTT CTGCCTCACT TCAAAAGATT GACGTGGAAG GTGTCAAAGT 6060 

GACTATCGTC CACTCAGCGG TCGGTGCTAT CAACGAATCA GACGTGACCC TTGCCGAAGC 6120 

TTCAAATGCC TTTATCGTTG GTTTCAACGT ACGCCCTACA CCACAAGCTC GTCAACAAGC 6180 

AGAAGCTGAC GATGTGGAAA TCCGTCTTCA CAGCATTATC TACAAGGTTA TCGAAGAGAT 6240 

GGAAGAAGCT ATGAAAGGGA TGCTTGATCC AGAATTTGAA GAAAAAGTTA TTGGTGAAGC 6300 

GGTTATCCGT GAAACCTTCA AGGTGTCTAA AGTGGGAACT ATCGGTGGAT TTATGGTTAT 6360 

CAACGGTAAG GTTGCCCGTG ACTCTAAAGT CCGTGTTATC CGTGATGGTG TCGTTATCTA 6420 

TGATGGTGAA CTCGCAAGCT TGAAACACTA TAAAGACGAC GTGAAAGAAG TGACAAACGG 6480 

TCGTGAAGGT GGATTGATGA TCGACGGCTA CAATGATATT AAGATGGATG ATGTGATTGA 6540 

GGCGTATGTC ATGGAAGAAA TCAAGAGATA AGATTTTTTG CTCCTTTCTT AGGTGGTGAG 6600 
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GGACGCAAGC AAACCGATGG TTTCATTGCT TATTTTTGAG CCTAGGGTCT CAAAAATCCC 6660 

CTGTGATGGG ACTGATAAAT CAGTTCCATC ACTTTCACCA CGGCGAAAGA AGCAGATGAC 6720 

TTCAAATTGA ACTTCGTTTC AATTTAAACT GAAAATCAAG AAGTTTAAAA TAGCTAGGTC 6780 

TGCTGGCCTA GCTTTTGGTT CAAAGTAGAG AAAGGAATAT CATGGCAAAT CATTTCCGTA 6840 

CAGATCGTGT GGGCATGGAA ATCAAGCGTG AAGTCAATGA GATTTTGCAA AAGAAAGTCC 6900 

GTGATCCACG TGTCCAAGGT GTGACCATCA TAGATGTTCA GATGCTGGGT GACTTGTCTG 6960 

TTGCCAAGGT TTATTACACC ATTTTGAGTA ACCTTGCTTC GGATAACCAA AAAGCCCAAA 7020 

TCGGGCTTGA AAAAGCAACT GGTACCATCA AACGTGAACT TGGTCGCAAT TTGAAATTGT 7080 

ACAAAATCCC AGATTTGACC TTCGTCAAAG ACGAGTCCAT CGAGTATGGA AACAAGATTG 7140 

ACGAGATGCT ACGCAATCTG GATAAGAACT AAAGAAGAGG GGTTGCCCCT CTTTTTTGGT 7200 

GGAGGAAAAT AGGTTGAATT TGAAATGGAA AAATATTCTT TTATAATAGA TTGAAACTAG 7260 

AATAGTACGC CTCTACTTCT AAAATATTGT TAGAAATCGA TTTGACTGTC CTGATCGATT 7320 

TGTCCTGTTC TTGTTTCATT TTAATATAAA AAAGGGATTC TGTATTTTTT AATGTTATCT 7380 

AATTAGAAAA TGCTTTTTTT GTAGGAAATA TAATATGATA AGGTGCAAAA AAGAAATAAG 7440 

GAGTTTGTAT ATGGCTGAAC AAGACTTAGC TATGCAAGTA TTGCAACAAG TGGTGAAACT 7500 

ACCTGTTGTT AAGGTTGATC GTTCGAAATT TTTAGTGGAT AAGTTTTCCA AAGAATTGGA 7560 

TCCAAAAGAT ATTCCTACCT TATTGGAACA AGGTCCAACG ACTCTTCTAT CTCAAGAAAT 7620 

ATTAGATCGT GTAGCTAATG CTTGTATTCG GGACAATGTA TTATTAGCGA GTGGGACTTC 7 680 

TGTTTTGGCA GGATTACCTG GAGGGCTTGC TATGGCAATT ACCATTCCAG CTGATGTGGC 7740 

TCAATTTTAT GCTTTCTCTC TGAAATTGGC TCAAGAATTA GGTTATATTT ATGGTTATGA 7800 

GGATCTTTGG GCTTCACGAG AGGAGTTGAG TGAAGATGCT CAAAATACCC TCTTGCTTTA 7860 

TCTAGGCGTA ATGTTAGGGG TGAATGGAAC CGCTGCTTTG CTACGTGTTG GTAGTATAAC 7920 

AATTGCCAAA CAGGTAATGA AAATAGTGCC TAATAAAGCT TTAACAAAGA CGCTTTGGTA 7980 

CCCTATTTTG AAAAAAGTCT TAAAAATATT TGGTGTGAAT CTTACCAAGG GAGGGTTGGC 8040 

CAAAGGAATG GGGAAATTTA TTCCTATCTT GGGTGGTATC ATTTCAGGTG GTTTAACCTT 8100 

TGCAACTATG AAACCAATGG GGGAAAGCTT GCAGAAAGAA TTATCCAAGC TAGTCAACTA 8160 

TAGTGAAGTT CAATATCAAG AAGATGTTGA AACAATCCGA AAAGAGGCTG AAATCATCAA 8220 

AGGAGAGTAA TATGAATCCT ATCAAAGCTT TTGCTAAAAT TTATGGTAAT TACTTTTTGA 8280 

CCGTGCAAGG TGTAAAAGTG ATGAAAACGA TAAAGAAAGC TGACCATGTC GTTGTTGGTC 8340 

TGGGGAAACT TTTTATTGCC GACAAGTTAA TGGATACGGC TCGGTGGCTC ATTAAGCCAG 8400 
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AGGAGAGAGA ATGAAATTTT 


TTTGGTCTTC 


TTGCTATTCT 


TTTTATCAAA 


CCGATTATTG 


8460 


GGATTGTGAA 


ATTCTTTTGG 


ATGATCATCT 


CTTTTGCAGT 


CCAATTGCTG 


TTTTACAAGA 


8520 


TAGTGTTTAA 


GATATTGGAT 


TGGCTCTTTA 


AACTTATCTA 


GATGGTAATC 


CAAGTTGCAG 


8580 


AGAACTAGCA 


GGAACTCCAC 


TGCTAGTTTT 


TTATTCTCTT 


TCCATATGGT 


ATAATATAAG 


8640 


CAGTAAAATC ATTTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAGCT 


TCACCTTGCA 


8700 


GTATATATGT 


TACTGACTTC 


GTCAGTTCTA 


TCCACAACCT 


CAAAACGGTG 


TTTTGAGCTG 


8760 


ACTTCGTCAG 


TTCTATCTAC 


AACCTCAAAA 


CACTGTTTTG 


AGCAACCTGC 


GGCTAGCTTC 


8820 


CTAGTTTGCT 


CTTTGATTTT 


CATTGAGTAT 


TAGAACATAC 


AATGGAGGTC 


GTCATGGACA 


8880 


ATATCATCGA 


TGTGTCAATT 


CCTGTTGCAG 


AAGTGGTGGA 


CAAGCATCCA 


GAAGTCTTGG 


8940 


AAATTCTAGT 


GGAGTTGGGT 


TTTAAACCCC 


TTGCCAATCC 


CTTAATGCGC 


AATACAGTTG 


9000 


GTCGTAAAGT ATCACTTAAA 


CAGGGTTCTA 


AGCTAGCAGG 


AACTCCTATG 


GACAAGATTG 


9060 


TACGCACACT 


GGAAGCGAAT 


GGCTACGAAG 


TGATTGGATT 


AGACTAATGA 


CAGATGAACG 


9120 


GATTCATATC 


CTACGGGATA 


TTTTGTTAGA 


ATTGCACAAT 


GGCGCCTCTC 


CTGAGTCGGT 


9180 


TCAAGATCGC 


TTTGATGCGA 


CCTTTACGGG 


CGTGTCAGCC 


ATCGAGATTT 


CCCTTATGGA 


9240 


GCACGAGCTG 


ATGAACTCGG 


ATTCGGGCGT 


CACTTTTGAA 


GATGTTATGG 


AACTCTGTGA 


9300 


TGTCCATGCC 


AATCTTTTTA 


AAAATGCTAT 


CAAAGGTGTC 


GAAGTTTCAG 


ATACTGAGCA 


9360 


TCCAGGTCAC 


CCAGTTCGTG 


TCTTCAAAGA 


AGAAAATCTG 


GCTCTCCGTG 


CGGCCTTGAT 


9420 


TCGCATTCGT 


AGATTGTTAG 


ATACCTATGA 


GTCTATGGAA GACGAGGAAA TGCTGGCGGA 


9480 


GATGCGTAAG 


GGTTTGGTGC 


GTCAGATGGG 


ACTTGTGGGT 


CAATTTGACA 


TCCATTACCA 


9540 


ACGTAAGGAA GAACTCTTCT 


TTCCTATCAT 


GGAGCGCTAT 


GGACACGATT 


CACCTCCCAA 


9600 


AGTTATGTGG GGAGTGGATG 


ATCAGATTAG 


GGAACTCTTT 


CAAACAGCTC 


TAACGACAGC 


9660 


CAAGTCACTA 


CCAGAAGTGT 


CAATTAGCAG 


TGTAAAGGAA 


GCTTTTGAAG 


CTTTTGCGAC 


9720 


AGAGTTTGAA 


AGTATGATTT 


TCAAGGAAGA 


GTCCATCCTC 


CTCATGATTC 


TCCTTGAGTC 


9780 


TTTTACTCAG 


GATGACTGGC 


TTCAGATTGC 


GGAGGAGAGC 


GATGCCTATG 


GCTATGCCAT 


9840 


CATCCGTCCG 


TCAGAGAAAT 


GGGTGCCAGA 


ACGACAGAGC 


TTTATTGAGG 


AAAAGATTGC 


9900 


AGAGGAGCCT 


GTACAGCTAG 


ATACGGCAGA 


AGGTCAAGTT 


CAACAAGTCA 


TAGATACGCC 


9960 


AGAAGGCCAT 


TTTACCATTA 


CCTTTACCCC 


TAAGGAAAAG 


GAAGCTGTGC 


TGGACCGCCA 


10020 


TAGTCAACAG 


GCTTTTGGTA 


ATGGCTATCT 


TTCAGTCGAG 


CAGGCCAATC 


TCATCCTCAA 


10080 


TCATCTCCCT 


ATGGAGATTA 


CCTTTGTCAA 


TAAAGAAGAT 


ATTTTCCAGT 


ATTACAATGA 


10140 



J 
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CAATACGCCA GCTGATGAGA TGATTTTCAA ACGGACGCCG TCCCAAGTCG GGCGCAATGT 10200 

CGAACTCTGC CATCCGCCTA AGTACTTGGA CAAGGTCAAA ACTATCATGA AGGGGCTTCG 10260 

TGAGGGAAGC AAAGACAAGT ATGAAATGTG GTTCAAGTCT GAGTCGCGAG GTAAGTTTGT 10320 

CCACATCACC TATGCTGCAG TACACGATGA AGACGGAGAA TTCCAAGGAG TGTTGGAGTA 10380 

TGTTCAGGAT ATCCAGCCCT ACCGTGAGAT TGATACGGAC TATTTTCGTG GATTAGAATA 10440 

AGGAGAAAAA ATGAGTTACG AACAAGAATT TATGAAGGAA TTTGAAGCTT GGGTCAATAC 10500 

CCAAATCATG ATTAACGACA TGGCGCACAA GGAAAGCCAA AAAGTTTACG AAGAAGACCA 10560 

GGACGAGCGT GCCAAAGATG CCATGATTCG CTACGAGAGT CGCTTGGATG CTTATCAGTT 10620 

CTTGCTTGGT AAGTTTGAAA ACTTCAAAGT AGGCAAGGGA TTCCATGATT TGCCAGAAGG 10680 

CTTGTTTGGT GAGCGAAATT ATTAAACGAG AAAGATTCTT GATTTTTCAC TAAAATCTTG 10740 

ATAGAATGTT TATGTTAAAT CCTTGTCAGA GCAGGGATTT TTTATTGAAA GGATTTTATC 10800 

ATGTCAAAGA AACTCAATCG TAAAAAACAA TTACGAAATG GCCTCCGTCG CGCAGGTGCC 10860 

TTTTCAAGTA CGGTGACTAA GGTTGTAGAT GAGACAAAAA AAGTCGTGAA GCGTGCAGAA 10920 

CAGTCAGCAA GCGCAGCTGG TAAGGCTGTT TCTAAAAAAG TTGAACAAGC AGTAGAAGCT 10980 

ACCAAAGAGC AAGCTCAAAA AGTAGCTAAT TCTGTAGAAG ATTTTGCAGC AAATTTGGGT 11040 

GGACTTCCAC TTGATCGTGC CAAGACTTTC TATGATGAAG GAATCAAGTC TGCTTCAGAT 11100 

TTCAAAAACT GGACTGAAAA AGAACTCCTT GCCTTGAAAG GAATCGGCCC AGCTACCATC 11160 

AAGAAATTGA AAGAAAATGG CATCAAGTTC AAGTAATTTT TCTTGAGCCT TGCATTTCCG 11220 

AAAAAATCTT GCTACAATAG AGCCATTAGA GGTGTTTTGA ATCCCACATT TTACAGAAAG 11280 

TGGCGGCGCT GAGAAGTCCA CAAATGTGTC AAAACTGGTT GCTAATGGAT GAAAAATTGA 11340 

AATAAAAGTG TCTTTTTGCT TTAAAGACGA GAGTTGCG 11378 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 60 

AACAAGAAAA GGTTCAAGTT GTGACCTTTA TGCCAAATGA AGGTCCTGAT GATCTATACG 120 

CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 180 
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ACCTTTGGAG 


TGGTTCTCCA 


TTTAACCAAG 


CTAGTCGCGT 


GATGGGAGAA 


AATCCTGAGC 


240 


GTAAGTTTGC 


CATCATCACA 


GGACTTAACT 


TACCGATGTT 


GATTCAAGCC 


TACACAGAGC 


300 


GCCTCATGGA 


CGCTGCTGCA 


GGTGTAGAAA 


AAGTCGCTGC 


TAATATCATT 


AAAGAAGCCA 


360 


AAGATGGCAT 


CAAAGCTCTT 


CCAGAAGAGC 


TAAATCCAGT 


CGAAGAAGTT 


GCAAGCGCTG 


420 


CAGCTGCTCC 


AGTTGCCCAA 


ACTGCTATCC 


CAGAAGGAAC 


TGTTATCGGA 


GACGGTAAAT 


480 


TGAAAATCAA 


TCTTGCCCGT 


CTTGACACAC 


GTCTACTTCA 


CGGTCAGGTT 


GCAACTGCTT 


540 


GGACTCCAGA 


TTCAAAAGCA 


AATCGTATCA 


TCGTTGCTTC 


AGATAACGTG 


GCTAAAGACG 


600 


ACCTTCGTAA 


AGAATTGATT 


AAACAAGCAG 


CTCCAGGTAA 


TGTCAAGGCT 


AACGTGGTTC 


660 


CAATTCAAAA 


ACTGATTGAG 


ATTTCAAAAG 


ACCCACGTTT 


TGGAGAAACA 


CATGCCCTTA 


720 


TCTTGTTTGA 


AACACCTCAA 


GATGCCCTTC 


GTGCCATCGA 


AGGCGGCGTG 


CCAATCAAGA 


780 


CTCTTAATGT 


TGGTTCTATG 


GCTCACTCAA 


CAGGTAAAAC 


ATTGGTCAAT 


ACCGTTTTGT 


840 


CTATGGACAA 


AGAAGACGTT 


GCTACATTTG 


AAAAAATGCG 


TGACTTGGGT 


GTTGAATTTG 


900 


ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT TGACTTGATT 


AACAAAGCCA 


960 


ATGTCAAATA 


AGCCATTATT 


TATGAAAGGA 


TTTTAAACAT 


GTCTATTATT 


TCTATGGTTT 


1020 


TAGTAGTCGT 


TGTAGCCTTC 


TTTGCAGGTC 


TTGAAGGCAT 


CCTCGACCAG 


TTCCAATTTC 


1080 


ACCAACCACT 


TGTAGCCTGT 


ACCCTTATTG 


GGCTTGTAAC 


AGGTCACTTG 


GAAGCAGGGA 


1140 


TTATCCTCGG 


TGGATCGCTT 


CAAATGATTG 


CCCTTGGTTG 


GTCAAATATC 


GGTGCTGCTA 


1200 


TCGCTCCTGA 


TGCTGCACTT 


GCTTCTGTCG 


CTGCTGCCAT 


TATCATGGTT 


CTTGGTGGTG 


1260 


ACTTTACCAA 


GACTGGTATC 


GGTGTTGCCC 


AAGCGGTTGC 


TATCCCTCTT 


GCTGTAGCTG 


1320 


GACTTTTCTT 


GACAATGATT 


GTTCGTACAA 


TTTCAGTTGG 


TTTGGTTCAT 


ACTGCAGATG 


1380 


CTGCCGCTAA 


AAAAGGTGAC 


TTCGGCGCTG 


TGGAGCGTGC 


GCATTTCATC 


GCGCTACTTT 


1440 


TCCAAGGACT 


TCGTATCGCG 


CTTCCTGCAG 


CTCTTCTCCT 


TATGGTACCA 


ACTGAAACTG 


1500 


TACAAAGTAT 


CCTTAGTGCC 


ATGCCAGACT 


GGCTCAAAGA 


TGGTATGGCT 


ATCGGTGGTG 


1560 


GTATGGTCGT 


TGCCGTTGGT 


TACGCCATGG 


TTATCAACAT 


GATGGCAACT 


CGTGAAGTAT 


1620 


GGCCATTCTT 


CGCTCTTGGT 


TTCGTTCTCG 


CTGCTGTGTC 


AGATATTACT 


CTAATCGGAT 


1680 


TCGGTGCTAT 


CGGCGTTGCT 


ATCGCTCTTA 


TCTACCTTCA 


CCTTTCTAAA 


ACTGGTGGAA 


1740 


ATGGTGGCGG 


AGGAGCCGCA 


ACTTCTAACG 


ACCCAATCGG 


CGATATCCTA 


GAAGACTACT 


1800 


AAGATAAGAA 


AGGACTGAAA 


ACATCATGAC 


TGAAAAACTT 


CAATTAACTA 


AATCAGATCG 


1860 


TAAAAAAGTT 


TGGTGGCGTT 


CAACCTTCTT 


ACAAGGGTCT 


TGGAACTTTG 


AACGGATGCA 


1920 



i 



WO 98/18931 



PCT/US97/19588 



1208 

AAACTTGGGC TGGGCTTATA CACTCATTCC AGCTATCAAA AAACTCTATA CTAAAAAAGA 1980 

AGATCAAATC GCTGCTCTTG AGCGTCACCT TGAGTTCTTC AACACTCATC CATACGTAGC 2040 

TGCTCCAGTC ATGGGGGTTA CTCTTGCGCT TGAAGAAGAA CGTGCTAACG GTGTGGAAAT 2100 

CGATGACGCT GCTATCCAAG GGGTTAAAAT CGGTATGATG GGACCTCTTG CTGGTATCGG 2160 

TGACCCAGTA TTCTGGTTTA CAGTACGCCC AATCCTTGGA TCTCTCGGTG CTTCACTTGC 2220 

CCTTACTGGC AATATCTTGG GGCCACTCCT CTTCTTTGTT GCATGGAACT TGATTCGTAT 2280 

GTCATTCTTG TGGTATGTTC AAGAGATTGG ATACAAGGCT GGATCAGAAA TCACTAAAGA 2340 

TATGTCTGGT GGTATCCTTC AAGATATCAC TAAAGGAGCT TCTATCCTTG GGATGTTCAT 2400 

TCTTGCTGTC CTTGTTCAAC GCTGGGTAAA TATTAAATTT GCTTTCGATG TTTCTAAAGT 2460 

TCAACTAGAT GAAAAGGCTT ATATCCATTG GGATAAATTG CCAGAAGGGT CTAAAGGTAT 2520 

CCAAGAAGCA TTCGCACAAG TAGGACAAGG ATTGTCTCAA ACTCCTGAAA AAGTTACTAC 2580 

TTTCCAACAA AACTTGGATA TGTTGATTCC TGGATTATCA GGACTACTCC TTACTTTACT 2640 

TTGCATGTAC TTACTTAAGA AAAAAGTATC TCCAATCACT ATTATCCTTG CCCTCTTCGC 2700 

AGTGGGTATT GTGGCACATG TTCTTCACAT CATGTAATCA AGCAACTAAA AAGGAACCAG 2760 

GTTCTAAAAT CTGATTCCTT TTTTCTATGC TTTTATTCAG CCAAGGCTCC CATTGGATCC 2820 

CATGGTGCAA GTACGATTGG TTCTGCTCCA TAGGCAGCTT GTTCTTCTGC TGTCAGCAAT 2880 

TCCTTACGAA CAACGATTTG GTATGTGTAT TCGTCCATCC AAGCGTCTGA GGCAACAAAG 2940 

TAACCATCTG TACCGACCTT GTCTCCCCAT GAGTTTTCAA CCTTCCACTT GGTTGATTTA 3000 

CCATTTTCGT CCAAGTCAAC ACCTGTCAAG ACCATGGCGT GGGTCATCAA GCTTTCACTA 3060 

TAGTCCAAAC GTCCAGCCTT GTCTTGAGTA AGTTTAATGT CCATGCTTGA TTCAAAGTCA 3120 

TAAACATCTG TCGCAAGGAT GCCAGCTTAC GGTTGCTGAG CTGGCCGACA TCAGAACCAA 3180 

ACCAAACAGT CTCACCTGCT TGCATTTGGG CAATCGCCAA TTCTTTCAAG CGCTCCATTG 3240 

GAACGTTGAT GTAGCGAACT GCACGGCTAC CAACCACATT CCCCAACATC TCAACTGTGT 3300 

AAGATTTTCC GTAAGGTTTA TCAGCAGTTG GAGCATTGAT AACAGAAACG TAGTCTTCTA 3360 

AAGGAAGATT GACATATTTC TTGTAAAACT CTTGTGGTGT GATTCCTTTT TCACTTTTGT 3420 

AGTTGTTATC TTTATCGCGA TAAGCAAAGT CAAACTTGCG TGGTGGAAGT CCTAATGACA 3480 

TAGCAAGAAA GTTAAAGATT TCTTGCAAGA GGTCTTCTTT CTTAGCTTGA ACAGTCGCTT 3540 

GATCTGCACC AGAAACAAGC AAGTCACGCA AGATTTGAGC ATCTTGACGA AGCAATTTAT 3600 

TAAGGATCGC ATTTAGCTCA CGACTGCTGC TAGATGAAAC AGACTCAGGA TAAACTGACT 3660 

TAGGCACGAC ACCGTATTTT TCAAAGAGGG AAACGACCAT ATCCCATTGA CCGCCATCTT 3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAAC t T GCGGCTAGTC AATTCTTGGT CTGAAGTCGC 3780 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGC 3840 

TTGTGACAAC TCAAAGTTCT CCAATTTGTA TTGCGAGATG AGTTTGTGGC GGAAGGTGTT 3900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CGCTTTCTGG TTAGTGACCT TGTCCTTGGT 3960 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGGCTT TGGCGACGTT CCAGAGCTGC 4020 

AAAAATTCCG TTGTGGCTGG CAGCATTTTC AATCGCTTGG TATTTTACAT TTGCTTCATA 4080 

GTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 4156 



(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



AAAAACAACA 


AAATAAAACA 


AAAACAAAAA 


TATCGAGGTT 


TATTTTCAAA 


ACTTTCGATA 


60 


TTTTTATTAA 


GTTATTATTT 


TGTTGTTTCT 


AGTTTACTTT 


TTGATGGTTA 


AGAGTGGTGG 


120 


AGAATTATAC 


TCAATGAAAA 


TCAAAGAGCA 


AACTAGGAAG 


CTAGCCGCAG 


GCTGTACTTG 


180 


AGTACGGCAA 


GGCGAAGCTG 


ACGTGGTTTG 


AATTTGATTT 


TCGAAGAGTA 


TTAGTGCAAA 


240 


CCGTAGTTGT 


AGTCATCATC 


TTGCATGGCT 


TCAACTTCGC 


CAAGAAGGTA 


ACCATTTCCG 


300 


ACTTGAGAGA 


AGAAGTCATG 


GTTGGAAGTT 


CCTGTTGAAA 


TACCGTTCAT 


AACGATTGGG 


360 


TTGACATCTT 


CAGCTGAATC 


TGGGAAAAGT 


GGATCTTGTC 


CCATGTTCAT 


GAGAGCTTTA 


420 


TTGGCATTGT 


AGCGAAGGAA 


GGTTTTAACC 


TCTTCAGTCC 


AACCAACACC 


GTCATAAAGA 


460 


CTCTCTGTGT 


AGCCTTCTTC 


ATTTTCATAA 


AGAGTATAGA 


GTAGGTCGTA 


CATCCATTCT 


540 


TTGAGTTTTT 


CTTGCTCTTC 


TTCAGGTAAT 


TCATTGAAAC 


CAAGTTGGAA 


TTTGTAACCA 


600 


ATGTAGGTTC 


CGTGAACAGA 


CTCGTCACGA 


ATAATCAATT 


TAATGATTTC 


TGCAACGTTG 


660 


GCAAGTTTGT 


TGTTACCGAG 


ATAGTAGAGG 


GGAGTGAAGA 


AACCAGAGTA 


GAAGAGGAAG 


720 


GTTTCGAGGA 


AGACGCTGGC 


AACTTTCTTT 


TCAAGTGGGC 


TGCCGTTTAG 


GTAGATTTCG 


780 


TTGACAATCT 


CAGCCTTCTT 


TTGTAGGTAA 


GGATTGGTAT 


TGGTCCATTC 


GAAAATTTCT 


840 


TCAATCTCAG 


CCTTAGTATT 


CAAGGTAGAA 


AAGATTGATG 


AGTAAGATTT 


AGCGTGGACA 


900 
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GATTCCATAA ATTGGATGTT ATTGAAGACA GCTTC CTCAT GTGGTGTACG GATGTCTGCG 960 

CGAAGGGCTT GAACCCCAGT TTCAGATTGC ATAGTGTCAA GAAGGGTTAA ACCACCAAAA 1020 

ACTTTTCCGA CCAAGTCTTT CTCTTTGTTA GATAGCTTTC TCCAGTCATC CAAGTCGTTT 1080 

GATAAGGGAA TACGTGTATC GAGCCAAAAT TGCTCCGTCA GTTTTTCCCA AGTTGATTTG 1140 

TCGATGACAT CTTCGATGGC ATTCCAGTTA ATGGCTTTGT AGTAAGTTTC CATTTAAAAT 1200 

CTCTTTCTGT GTTTAGTATT GCGAACTCAC AATTATTTCT ACTTTACCAT AATTCTATAG 1260 

GAGTATCGCA CAAAAAGTCG GAAGCCCGAC TTTTAAAATG TTACATAAAT TATGTTATGA 1320 

CATAGTAGAT TTGATTTTAT CAGTGCTGCT TAGGGAAAAA TAGTGTTTCT ATGCTAGAAA 1380 

CTAAATCACA CAGCTTTCAC ATTGGTTGGC GCCGACTTCT CCACCGTCAT CTGTAAAGGT 1440 

ACGGACGTAG TAGATAGACT TGATTCCCTT GTTAAAGGCA TAGTTACGAA GGATGGACAA 1500 

GTCACGTGTC GTTTGTTTAT TTTCCCTCTT CCATTCGTAA AGGCCTTTTG GAATGTCACT 1560 

GCGCATGAAG AGGGTGAGTG AAAGTCCTTG ATCCACGTGT TCAGTCGCAG CAGCGTAAAC 1620 

ATCGATGACT TTACGCATAT CCATATCGTA GGCAGAAGTG TAGTAAGGAA TGGTTTCTGT 1680 

AGACAAGCCA GCAGCAGGGT AATAGATTTT ACCAATTTTC TTCTCTTGGC GTTCTTCGAT 1740 

ACGTTGCGTA ATCGGGTGGA TAGAAGCAGA AACGTCGTTG ATATAGCTGA TAGAACCATT 1800 

TGGCGCTACA GCAAGGCGAT TTTGGTGGTA AAGACCATCT TCTTGAACCT TGTCGCGAAG 1860 

TTCAGCCCAA TCAGCAACAC CAGGGATAAA GACATTTTTG AAGAGTTCTT TAACACGGTC 1920 

TGATGTTGGA ACAAATTCAC CAGTTACATA CTTGTCAAAG TAACTTCCGT TAGCATAGTC 1980 

TGATTTTTCA AAGTTGTGGA AGGTAATACC ACGTTCACGT GCAATATTGT TTGACTCTAC 2040 

CAAGGTCCAG TAGTTCATAA GCATAAAGTA GATGCTTGTA AATTCAACAG ACTCAGGTGA 2100 

ACCATATTCA ATGAGTTGTT GGGCAAGGTA GCTGTGCAGT CCCATGGCAC CGAGACCAAA 2160 

GGTGTGGGCT TGGCTATTTC CATGGTCAAT CGTTGGTACA GCTACGATAT GTGAACTATC 2220 

TGTAACGAAA GTAAGGGCAC GAACCATAGC ACGGATAGAA CGACCAAAAT CAGGTGAAGT 2280 

CATCATGTTA ACCACGTTGG TTGAACCCAG GTTACATGAA ACATCTGTTC CCATTTGAAG 2340 

GAATTCTTGA GCATCGTTGA TCAAGCTTGG TTCTTGAACT TGAAGAATCT CAGAACACAA 2400 

GTTACTCATG ATAATCTTTC CATCAACAGG ATTTGCACGG TTAGCCGTAT CGATGTTGAC 2460 

TACATAAGGA TAGCCAGACT CTTGTTGCAA TTTAGAGATT TCAGTTTCCA AATCCCGCGC 2520 

CTTGATTTTT GTCTTGCGAA TATTTGGATT TGCGACCAAT TCATCGTATT TTTCAGTAAT 2580 

GTCGATGTAA TTGAATGGCA CACCGTATTC TTTTTCTACA GAGTAAGGGC TGAAGAGGTA 2640 

CATTTCTTCA TTTTTACGAG CCAATTCGTA GAATTTATCA GGTACTACAA CACCAAGTGA 2700 
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TAGAGTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTGAAA GGAAAGCGAT 2760 

GATATCTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACGTT GCCCCAATTG 2820 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACGACAC CTGAAGCAGC 2880 

TCCTTCATAG CCTTTGATAG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 2940 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGAGTTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 3060 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 3120 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCGTCATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTGGTCT TTGATAAATT GAGCTAATTC 3300 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTGTT TCGATGTAGT TGTGTTCAAT 3360 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 3420 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 3480 

ACGGTTAATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 3540 

AAAATTTCCC TTATCTAATT ACAAAAGAAA GGCTTCTAAG TTAGCCCTAA AAGCAGTTTC 3600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAAACCTGAA 3 660 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AACCGAGCTC TTTAACTTGA 3720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTACTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 3840 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 3900 

GG 3 902 
(2) INFORMATION FOR SEQ ID NO: 213: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 60 

CTGATAACCT AAAAAGGATA GTCAATTATG CTTGTTTACT AACTATTAAC TATGCTAAAT 120 
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CAATTGAGGT 


TGTTTACATA 


AAACTCTATA 


TCAGAGAAGC 


CTGATATAGA 


GTTTTTTCTT 


180 


GCTAGTTTTA 


GGATTTTTTT 


GTAAAATAGA 


AAAAGTGAAG 


AGAGGTATGA 


AATGAGCAAG 


240 


AAAGATAAAA 


AAATCGAAAT 


TCAAGTAGCG 


GATGCCAAAG 


TTAATGTTGG 


TAAAGACAGT 


300 


TTTGAAGGTT 


ATACATTGAC 


TATCGGTAAA 


AAAGTTATCG 


GAGAAATTGC 


CGAATTAGAC 


360 


GGACAATTTG 


CCATTATAAA 


GAATGGGAAT 


GTCGATAGTT 


TTTATAAAAA 


ATTGGAAAAA 


420 


GCTGTGGAAA 


TTTTGATTGA 


AAATTATAAT 


TTAGCAAAAT 


AAGTCTTGTT 


x x x x lunnnl 


A flfl 


TTTCATGATA 


TAATAGTCCA 


TGTTGATTGT 


AGGAGAGATA 


GGG AAGAGGf 


t a a be fierce 


*\A A 


GGACTGTAAA 


TCCGCCCCTT 


CGGGTTCGGG 


GG TTCGAATC 


V- V— x v~ x \— x v- x v_ 


P a TTTP & TT & 
\-t\ ill ^ A 1 1 A 


ouu 


ATGGGGTATA 


GCCAAGCGGT 


AAGGPAAGGG 


APTTTP A PTP 


PPTP ITPPPT 


1 vtvj 1 1 L(jnA i 


660 


CCAGCTACCC 


CAGTT CT T AG 


GTAATAATPA 


Ad at &rc a a&p 

nun l nun/v\u 


I* AAAA I A 1 L. i 


1 AuGG 1 Alii 


720 


TATTTTTATA 


ATTGAAAGAf* 


CZTfZ A ATY2 A T A 
V3 iunn X\zA 1 A 


1 vi AAt-A l\j 1 V_ 


PTTPfypPW 
L 1 i LjL^vjLAj L\j 


C 1 I AGGAAAA 


780 


AAATTATAAG 


TATGTCAAfiT 


JL Innun/vvui 


PTTP a TTPTT 


o*jAW»»A1 111 


ITFaGATGAAC 


840 


GAATTTGAAG 


ATTTGCTAAA 


TARPfiTTAfST 
j. iiuVfU x x nu i 


pa AP.TTP.apA 

V> AAu I IvaAuA 


PTYl PTP A TP T 


lull Av? 1 vjv, 1 


900 


GAAGTATTGA 


CAGTTGATGP 


P. A PTr A A PPT 


a a ppttpp a a 

AALu 1 IvjLArt 


tptpttpi ap 


1 t>Ca 1\» 1 TG AA 


960 


GGTGTCTTGA 


^ i v# i x v« vv»\ati 


ATTPaPA MP 
Al 1\jA^AAA(- 


f~> * TV'P , nV > ATP 
un iVu i VjA 1 Lj 


CnuA 1 A 1 V AA 


TGACTTTGTT 


1020 


AAAGTAGGAG 


AAGTATTfifiA 


TP'T"I Vl'M'PT & 
lul 1 J. lu i rv 


PTTPPTPliP 


T a pttppt a a 

1 flu 1 IvjLs 1 AA 


AOA 1 ACTGAT 


1080 


ACAGTTACAT 


ACCTTGTATC 


TAAAAAAOGP 


PTTPA Af2PTP 


PPA AAPPATP 


nps pa a ip<pp 


1140 


GTTGGTCGCG 


AAGAAGAAGT 


X w X X X w X X 


AAAfifiAAPflP 


PTPPPP/PTa a 


a pptpp & c r v r v 

nWj 1 vvjAL. 1 1 


1 OPtfk 


TCAGTAGAAT 


TTGAAGGTGT 


TCGTGGATTT 




CAATGTTGGA 




1260 


GTACGTAACG 


CTGAGCGTTT 


TGTAGGTCAA 


GAATTTGATA 


CTAAAATCAA 


AGAAGTTAAC 


1320 


GCTAAAGAAA 


ACCGCTTCAT 


CCTTTCACGT 


CGTGAAGTTG 


TTGAAGCAGC 


TACTGCAGCA 


1380 


GCTCGCGCTG 


AAGTATTCGG 


TAAATTGGCT 


GTTGGTGATG 


TTGTAACTGG 


TAAAGTTGCT 


1440 


PprpaTPAPa » 
V-ui/ii L.AV.AA 


GC-TTCGGCGC 


TTTCGTCGAC 


CTTGGTGGTG 


TTGACGGATT 


GGTTCACTTG 


1500 


ACTGAATTGT 


CACATGAACG 


TAATGTATCA 


CCAAAATCAG 


TTGTAACTGT 


TGGTGAAGAA 


1560 


ATTGAAGTGA 


AAATCCTTGA 


TCTTAACGAA 


GAAGAAGGAC 


GTGTATCACT 


TTCACTTAAA 


1620 


GCAACAGTAC 


CAGGACCATG 


GGATGGCGTT 


GAGCAAAAAT 


TGGCTAAAGG 


TGATGTAGTA 


1680 


GAAGGAACAG 


TTAAACGTTT 


GACTGACTTC 


GGTGCATTTG 


TTGAAGTATT 


GCCAGGTATC 


1740 


GATGGACTTG 


TTCACGTATC 


ACAAATTTCA 


CACAAACGGA 


TTGAAAATCC 


AAAAGAAGCT 


1800 


CTTAAAGTTG 


GTCAAGAAGT 


TCAAGTTAAA 


GTTCTTGAAG 


TTAACGCAGA 


TGCAGAACGC 


1860 


GTGTCACTTT 


CTATTAAAGC 


TCTTGAAGAA 


CGTCCAGCCC 


AAGAAGAAGG 


ACAAAAAGAA 


1920 
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GAAAAACGTG CTGCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 1980 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 2040 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 2100 

ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC GAGAAAGGAC AAATTTTGTC CTTTCTTTTT 2160 

TGATATTCAG AGCGATAAAA ATCCGTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAAGG 2220 

CATTGCGCTT GATAAGTTTG ATGAGATTAT TGGTCGCTTC CAGTTTGGCG TTAGAATAGT 2280 

GTAGTTGAAG GGTGTTGACA AGCTTTTCTT TATCTTTGAG GAAGGTTTTA AAGACAGTCT 2340 

GAAAAATAGG ATGAACCTGC TTAAGATTGT CCTCAATAAG TCCGAAAAAT TTCTCCGGTT 2400 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT GATAGAGCTG ATAGTGGTGT TTCAGG 2456 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

AAATAGGATA TAGAGACATC CTTCTGATCT GCTTTTwACA AAGTCCAATT ATATGCGGAT 60 

CTATACCTCC ACAATGTCCA TTATTATmCC TAACTATAAT ATGAGCCGAA AACACTATAT 120 

CCTTAATGTC TCCATATCCA TCAGGGATAT TAATATTTAT TTTTCCACAA CTATATTGCA 180 

TTGTAACCAT CTCCTTAAAC GACGCATTAT GATATTTGAT AGAGAAATTT TTATGAATAA 240 

CTCAATAATT TTATAGTAAA TCATGCTTAT ATCTCAAAGA TACCTATTTT ATCTTGTCTC 300 

GACCTTCTCC AAAGAATTGC TATAATACTA TTACAAATCC ATCTGCACTA CACTTCAAAT 360 

TTTAGCACTG TATAAAAACG TTTCAATACA CTAACTTCAA GAAAACTTCC ACTATTAATT 420 

GAAAAAATTG ATAGAGATAA ATTAAAAATC TATATTGAAA CTCATCCCGA TGCTTATTTG 480 

ACTGAAATAG CTGCTGAATT CAACTGTCCT CCAACAACTA TTCATTACGC TCTAAAGGCT 540 

ATGGGATATA GTCTAAAAAA GAGCCGTACC TACTGCGAAC AAGACCCAGA AAAAGTAAAT 600 

CGGTTCCTTA AAGAATTGAA TCACTTAAGC TACCTGACTC CTATTTATAT TTATGAGACA 660 

GGGGTTGAGA CCTATTTTTA TCTCGAATAT GATCGAGCCT TGAGCAGGCA GTTAGTCTCT 720 

CTGGAAGAAG ATATAATTAT TTGAATTAAG ATCGAGACAA CGCACACCAG AGATTGCGAT 780 

ACTGTTATAG AAGTACTAAT GCCCTTTTTT GTTTCAATAT ACTATGGCTC CGATGACCTA 840 
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TAAAGATACG ATGACGAGTG ACTTTTTCGA AGCTTGCTTC CAAAAATTCT TACTACCTAC 900 

TTTAGATACA CCATCCCTTA TCATTATGGA CAATGCAAGG TTTCACAGAA TGAACATGTG 960 

TAAGGAGCAG GGCATAGACT GTTACCACTT CCTACCTATT CACCCGAGTA TAATCCCATT 1020 

GAGAAAATAT GGGCTTACAT CAAAAACATC TCAGAATAAT ATTGTCAAAT TACGATGCTT 1080 

TTCTTGAGGC ACTTTTGTCC TATTCTTGTT TCAGCCGACT ATACTCCGTT ATTGGGCAGC 1140 

TACGGAACAG TCGATGGGAC GATGGGGGGA CATAAAAAAA TCCTCCAGTT TTGTTTTTTA 1200 

TAACAGTATA CTGGAGAATT GACAATCTCG GTAGATACCT CGTTATAGCG CGGTTACTTA 1260 

TTAGGCAGTT ACAAAACAAC TGTGAACAGA AAACATTCCA GAGTCAGACA AGACTTTGGA 1320 

ATGTTTTGGC TCTATAATTT CTGTAGTGGG TAATCCCACC CCAGGAATTA TAGGGTCGTT 1380 

TCTTGTAGAA AAAAAGCCCC ATATGACCTA TAATGAAAAG CGTCTAACCA ACTCATTAGA 1440 

AAGGGTTCAT ATGGAACAAC TTAAGAATAC CACAGATTTG CTCGGATTGG AAGACAAAAA 1500 

TATCAAAATC TTGTCTGTTC TGAAATACCA AACCCATCTA GTCGTTCAGG CAAAGTTGGA 1560 

TTCCCCCGCT CCTCCTTGTC CTCATTGTCA AGGGAAGATG ATCAAATACG ACTTCCAGAA 1620 

AGCCTCTAAA ATTCCGCTTC TCGACTGTCA GGGTTTACCC ACGGTACTGC ATCTCAAAAA 1680 

GCGCCGCTTT CAGTGCAAGA ATTGCCTTAA GGTGGTCGTT TCTCAAACAT CCATTGTCAA 1740 

GAAAAATTGC CAGATTTCCA ACATGGTGAG ACAAAAAATC GCTCAGCTCC TCCTTGAAAA 1800 

GCAGTCTATG ACTGAGATTG CCCACAGATT GGCGGTCTCA ACTTCCACCG TCATCCGAAA 1860 

ACTGAGGGAA TTTAAGTTTG AAACCGATTG GACCAAGTTG CCAAAAGTTA TGAGTTGGGA 1920 

TGAGTATAGC TTCAAAAAGA GCAAAATGAG CTTCATTGCC CAAGATTTTG AGTCCAAATC 1980 

CATCCTCGCA ATTTTAGACG GGCGAACTCA TGCGGTGATT CGAAACCATT TCCAACGCTA 2040 

TCAGAGAGAG GTTCGGGAGC TGGTCGAGGT CATCACCATG GACATGTACA GCCCTTATTA 2100 

TCGGCTCGCT AAGCAACTCT TTCCAAAGGC GAAGATTGTT CTTGACCGCT TCCACATTGT 2160 

CCAACATCTG AGCCGAGCTA TGAACCGAGT ACGAATCCAA ATCATGAACC AATTTGACCG 2220 

AAAATCCTTG GAGTATCGGG CGCTCAAGCG CTTTTGGAAC CCTCGCTTTT TCGTTTCTAG 2280 

GCTCGGGCTA AATCAGTCCA CTGGACTGAT TTACTACACC AGTATAGCTT CAAGCTCTGT 2340 

CAGAAACGAT TCTATCAGCC CACGTTTCGA ATGCACTTAA CCCATCGGGA AGTACGAGAT 2400 

AAGCTGCTTT CTTACTCTGA GGGATTACAG GTTCACTACG AACTCTATCA ACTCCTGCTC 2460 

TTTCATTTTC AAGAGAAGAA TGCCGACCAT TTCTTTGGAT TGATTGAGCA AGAACTGCCA 2520 

ACGGTTCATC CGCTTTTTCA AACGGTCTTT TGGACTTTTT TAAGGGATAG AGATAAGATT 2580 

ATCAACGCAC TTAAGCTGCC TTATTCCAAC GCTAAACTTG AAGCGACCAA TAATTTGATT 2640 
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AAGATTATCA AGCGCAAAGC CTTTGGTTTC CGGAACTTTA ACAATTTTAA AAAACGGATT 2700 

TTGATGACTT TGAACATCAA AAAAGAGAGT ACGAATTTCG TACTCTCCAG ATTGCAGCTT 2760 

TTCGCCTACC CACTACACTT GACAAAGAGC CACTCTTTAT TCCATGGTAT CAAAGGCAAG 2820 

ACTTGGTTTG GCATTGAGGT CCCAGCCTGC GAAGTTTTCT TTGTTCCACT CGCTGACGCT 2880 

GGCATAGGCA ATCATACCTG CATTGTCTCC GCAGAGTCGC AGAGGGGGGA TGATAACCTT 2940 

GACATCTGTG ATTTCGGCTG CTAGGCGTTC TCTGAGACCT TTATTGGCTG CCACACCACC 3000 

TGCCACAACT AGGATTTTAA CAGGATATTT CTCCAAAGCC TTCTTGGTTT TTGCCATGAG 3060 

AATGTCCATA ACTGCTGCTT GGAAGGAAGC ACACAAATCT TCTGTAGACA GGCTTTCTCC 3120 

CTTTTGCTCG GCATTGTGAT GAAGATTGAT AAAGGCAGAT TTCAAACCTG AGAAGGAGAA 3180 

CTCCAGATTA TCTTCCTTAA TCATGGCACG GGGGAAATCA TAAATATCCT GCCCCTGATG 3240 

AGCCAGCTCG TCAATCTCAC GACCTGCAGG ATAGGTCAAG CCCATGACAC GGCCGACCTT 3300 

ATCATAAGCC TCACCAACCG CATCATCACG GGTTTCCCCA ACAATCTTAT AATCTCCTGC 33 60 

CTCCGAAACA TAAACCAACT CTGTGTGTCC GCCGCTGACC AAGAGGGCTA GCAAGGGAAA 3420 

CTCCAAAGGC TCCACACTCT GAGCTGCCAT GAGGTGCCCA GCCATGTGAT TAACAGGAAT 3480 

CAGTGGAAGT CCGTGAGCCC AAGCAAAGGC CTTGGCAGCT GACAAACCAA CTAGCAAGGC 3 540 

TCCGACCAAG CCTGGTCCGT AGGTAACCGC AACAGCTGTC ACGTCCTCTT CGGTAATCCC 3600 

TGCTTCTGCC AATGCCTCCT CGATACAGGC TGTAATGACC TCGACATGGT GACGACTGGC 3660 

TACTTCGGGC ACTACGCCAC CAAAACGTTT GTGACTCTCA ATTTGACTAG CAATGACATT 3720 

GGACAAGAGC TCATCGTCGT TTTTCAAGAC GGCGACACTG GTCTCATCAC AGGATGTCTC 3780 

AAATGCTAAA ATATATCTAT CCTTCATCTA TTTCTCTCTT CATGATAATG GCGTCCTCGA 3840 

CTGGGTCATG GTAGTAGGCC TTTCGCTCAG CGATAACTGT CATCTTTTCT TTCTTGTAAA 3900 

ATGCTTGCGC TCGTTGATTT GACTGTCTGA CTTCGAGGAA AATTTCCTTG TCTGTCGGCA 3960 

ATTGAGCAAA CAAGGcTGAC GCAATCCCCT GACCCTGATA AGCTCCTTTG ACAGCGATTT 4020 

GCAGGACTTC TGCTTCAAAA AGATTCTCCT GCACAGCTAG AAATCCAATC ACTTCTGCCC 4080 

CATCATAAGC CAATGCATAC CAAGTCTGGT CTTGGGACAG ATCTGCTTGG ATTTGCTCCA 4140 

GAGTCCAAGG ACTGACTAGG TAAACAGCTG CCATAACAGC GTAGATGGCT TGAGCTAGGT 4200 

CAGGCTGTTG TTGAATTCGC TTGATTTCTA TCATAGGCGT TTAATGTAAG ACTCGCCAGA 4260 

CTCGGTATGG TTCTTGAGCC AGTTTTCCTC AGCCTCGACT CGTTTGAGGT AATTCGGCAC 4320 

AAAATCATGC AAGGAGTCTG CTTCCTTGTC CCAGGCCAAA AGAGCTAGAT TAGCTGCATT 4 380 
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GGGCAATGTT 


TCTTTGTAAT 


CAGTCCTTGG 


CAAGTGTTTT 


TGAATCTGCT 


CAACAAAGGG 


4440 


GCCAACTTCT 


CCGACAAAGG 


TTACCTGACT 


AGTACCCTTG 


ACTTTTTCTA 


GCACCTCTTC 


4500 


AAAAGATAGG 


TGCGCTTCTG 


CCATGACAGG 


TTTGGCATTT 


TCATAAAATC 


CTGCATAAAC 


4560 


ATTATTGCGA 


CGCGCATCCA 


TCAAGGGGAC 


AAACAAACCT 


TCTTGTTGAT 


GGGGCACCAG 


4620 


AGCCAAGAGA 


CTCGACATAC 


CAACCAACTC 


GATGTTCAGG 


GTGTGAGCTA 


AGGTCTTAGC 


4680 


AGTTGCTACC 


GCAATTCGCA 


AGCCTGTATA 


GCTACCCGGC 


CCTTCAGCTA 


CCACGATTCG 


4740 


GTCCAAATCC 


TTGGGTGTCC 


AATCCAAACT 


TGCCATCAAA 


AAATCGATGG 


CAGGCATAAG 


4800 


AGTAATACTG 


TGATTTTTCT 


TAATATTAAT 


CGTCGTCTCG 


GCAAGAACCT 


GCTTATCCTC 


4860 


TAAAATAGCC 


AGAGAAAGAG 


CCTTGCTGGA 


CGTATCAAAA 


GCTAATACTT 


TC AT AAC AC A 


4920 


TTCCTATCTT 


TTTGTCTGCT 


TACTATTATA 


CTACAAAAGC 


TGGCACATGG 


GAATTTTCTT 


4980 


TGCCCCCAGA 


CAAGAGTGCC 


CTCACTTAAC 


TAAAAATAAT 


TTAAAAAAAT 


GCTCACTTTT 


5040 


CCTTTTCTTT 


TCCGAATATA 


AAAGTGAACA 


AGAAAAAAGG 


AGGAAAGTTC 


AATOAPAAAT 


5100 


TTTGACATTC 


TTGACAATCA 


ATTTTTATCC 


TTATCTGAAA 


ATGAATTATC 


AGATATTGAT 


5160 


GGCGGTCTCG 


CTCCCTTGGT 


TATCTTTGGA 


GTAGCAGTAT 


CTTGGAAGGC 


TATTGCAGGT 


5220 


GGAACAGCAC 


TTATAGGTTC 


TGGTTTGGCA 


GCTGGTTATT 


TTTTAGGAGG 


AGATTAATAT 


5280 


GATGAAAGAT 


TTGAACAATT 


ATCGTGAAAT 


TTCTAATAAG 


GAATTGCAAG 


AAATCAAGGG 


5340 


TGGCTTTGGT 


GTCGGTGTTG 


GTATCGCTTT 


ATTTATGGCA 


GGTTATACCA 


TTGGAAAAGA 


5400 


CCTTCGTAAA 


AAGTTTGGTA 


AGTCATGCTA 


GATAAGAAAC 


ACATTTTTAG 


AAGGATAAAT 


5460 


TTTATTGTCT 


TCATCTCTTA 


CAGTTTGCTC 


AGCATTCTCA 


ATGATTTGAA 


CATTACTACC 


5520 


ATCCCTTTAC 


CATTCGATTT 


ATCTGTTTGT 


ATTGTTTTAT 


TTTTATGCTT 


CAACTCTATT 


5580 


TTTGATCAGA 


ACAATGACTC 


CCATAAAAAT 


AATAAGCTTT 


GAAAATTCCA 


TTGTCATGTC 


5640 


ATGTTAGAAA 


AATGCAAAGA 


CCACCTCATC 


TTGATAGATG 


GGGTGGAATT 


TTCGTGTCGT 


5700 


AAATCTACTA 


TCTCTACATT 


CCCAAACAAA 


AAACCCCAGC 


ATAAGCAGGG 


CATCTAAGCA 


5760 


TTTAATTCAA 


AGTAAAATAC 


AAACCAAACG 


ACATAGGTCA 


CGAGGAGGAG 


AAAAAGCGAG 


5820 


TAGAGAGTCA 


CAAAGGTCAT 


TTTCCACAAG 


AACTTGGTTT 


GTCGTCGTTC 


CAGTTTGGCA 


5880 


AATAGAAGAT 


TCCCCGCATA 


AACGCAAGCA 


ACAAAAACAA 


TAAAAGCTAC 


CAAGCGAGCT 


5940 


CCGATAGCAA AAGCAAATAA 


GTTATACATA 


GGGCAACCTC 


CTTGACTTAA 


AATCTATATG 


6000 


GAATTATGAC 


AAGCAATAAA 


TTTCACTTCC 


GTTATCAACA 


TAATACATTT 


TCTTTATTTT 


6060 


TGAAAACGCT 


TACCAAAGAA 


ATCGTCCCCT 


AACTTTCTCG 


TTTCCGTCTT 


TTACTAATTT 


6120 


TTCATTTTGT 


GGTATAATTG 


AAATAATTGT 


AACGAATCAA 


GGTCAATCTA 


GACACAAAAT 


6180 
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GGAATGAAAT 


CAAGCAAATA 


TCTGCTAAAA 


GTTTGGAATA 


AGCTGACCTG 


TAAATAGAAA 


6240 


GGAACTATAT 


GATTTACAAA 


GTTTTTTATC 


AAGAAACAAA 


AGAACGTAGC 


CCACGCCGTG 


6300 


AAACAACACG 


CACGCTTTAC 


CTAGACATCG 


ATGCCAGCTC 


AGAACTTGAG 


GGCCGTATCA 


6360 


CTGCTCGCCA 


ACTTGTCGAA 


GAAAATCGCC 


CAGAGTACAA 


TATCGAGTAT 


ATCGAACTCT 


6420 


TGTCTGACAA 


ATTGCTCGAT 


TACGAAAAAG 


AAACTGGCGC 


CTTCGAAATT 


ACGGAGTTCT 


6480 


AATATGGCCT 


ACACTCTTAA 


ACCTGAAGAA 


GTCGGCGTTT 


TTGCCATCGG 


TGGTCTAGGA 


6540 


GAAATCGGGA 


AAAACACTTA 


CGGAATTGAA 


TACCAAGACG 


AGATTATCAT 


CGTCGATGCT 


6600 


GGGATTAAAT 


TCCCAGAAGA 


TGACTTGCTT 


GGTATCGACT 


ATGTCATTCC 


TGACTACTCT 


6660 


TACATCGTGG 


ACAATATCGA 


CCGCGTCAAG 


GCTGTTTTAA 


TCACACACGG 


ACACGAGGAC 


6720 


CACATTGGTG 


GGATTCCGTT 


CCTACTCAAG 


CAAGCAAATG 


TCCCTATTTA 


TGCTGGACCG 


6780 


CTTGCCTTGG 


CTTTGATCCG 


TGGGAAACTC 


GAAGAACACG 


GCCTCTTGCG 


CAACGCCAAA 


6840 


CTTTACGAAA 


TCAACCACAA 


CACCGAGTTG 


ACCTTTAAAA 


ATCTCAAGGC 


AACTTTCTTT 


6900 


AGAACGACTC 


ACTCTATTCC 


AGAGCCTTTG 


GGGATTGTCA 


TTCATACTCC 


TCAAGGGAAA 


6960 


ATCGTCTGTA 


CGGGTGACTT 


TAAGTTCGAC 


TTTACTCCAG 


TTGGAGAACC 


TGCGGACTTG 


7020 


CATCGTATGG 


CTGCGCTTGG 


TGAAGAAGGC 


GTGCTCTGTC 


TCCTGTCTGA 


CTCGACAAAT 


7080 


GCGGAAGTAC 


CAACCTTTAC 


CAACTCTGAA 


AAAGTCGTTG 


GTCAGTCCAT 


TATGAAGATT 


7140 


ATCCAAGGTA 


TTGAAGGACG 


TATCATCTTT 


GCATCCTTTG 


CCTCAAATAT 


CTTCCGTCTC 


7200 


CAGCAGGCAA 


CAGAAGCTGC 


TGTTAAGACT 


GGACGCAAGA 


TTGCGGTCTT 


TGGTCGTTCT 


7260 


ATGGAAAAGG 


CCATTGTCAA 


CGGAATCGAT 


CTTGGCTACA 


TCAAAGCTCC 


TAAGGGAACC 


7320 


TTTATCGAGC 


CAAATGAAAT 


CAAAGATTAT 


CCTGCAGGAG 


AAGTTCTTAT 


CCTCTGTACA 


7380 


GGTAGTCAGG 


GTGAGCCTAT 


GGCAGCCCTC 


TCTCGTATCG 


CCAACGGAAC 


CCACCGTCAA 


7440 


GTACAATTAC 


AACCAGGTGA 


TACCGTTATC 


TTCTCTTCTA 


GTCCCATCCC 


TGGAAACACT 


7500 


ACTAGTGTCA 


ACAAGCTGAT 


TAACATCATT 


TCTGAAGCTG 


GTGTCGAAGT 


TATCCACGGT 


7560 


AAAGTGAACA 


ATATCCATAC 


ATCTGGACAC 


GGTGGTCAGC 


AAGAGCAAAA 


ACTCATGCTC 


7620 


TGCTTGATTA 


AGCCAAAATA 


CTTCATGCCT 


GTCCACGGTG 


AATACCGCAT 


GCAAAAAGTC 


7680 


CACGCTGGAC 


TAGCAGTGGA 


TACTGGTGTT 


GAGAAGGACA 


ATATCTTTAT 


CATGAGCAAT 


7740 


GGCGATGTGC 


TTGCCCTTAC 


TGCTGACTCA 


GCTCGTATCG 


CAGGTCATTT 


CAACGCCCAA 


7800 


GATATCTATG 


TCGATGGAAA 


TCGTATCGGT 


GAAATTGGCG 


CAGCTGTCCT 


CAAAGATCGT 


7860 


CGCGATCTAT 


CTGAAGACGG 


TGTCGTTCTG 


GCAGTTGCAA 


CTGTTGACTT 


CAAATCGCAG 


7920 
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ATGATTCTAT CTGGTCCAGA CATCCTCAGC CGAGGCTTTG TCTACATGAG AGAGTCTGGC 7980 

GACTTGATTC GCCAAAGCCA GCGTATCCTC TTCAATGCCA TTCGTATCGC ACTGAAAAAT 8040 

AAGGATGCTA GCGTGCAATC TGTCAATGGT GCCATTGTCA ACGCTATTCG CCCCTTCCTC 8100 

TATGAAAATA CCGAACGTGA ACCGATCATC ATCCCGATGA TCCTCACACC AGATGAAGAA 8160 

TAAAGCAAGA AAACAGCCCC GTCCTCGGAG CTGTTTTTCT CTATGCTTTC TTTTGAGATT 8220 

AAAACTCATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA GGTTGCTCAA 8280 

AGCACTGCTT TGAGGTTGTA GATAGAACTG ACGAAGTCAG TAGCCATACC TACGGCAAGG 8340 

CGACGTTGAC GCGGTTTGAA GAGATTTTCG AAGAGTATCA ATAAAAATCG AAATCAGACT 8400 

AGAAGGCTAA GCGAAAGCAT AACTTGAGTT AGCTCCCATA GTTCGGGAAA CTATGGGAGG 8460 

CTGGAGATGA ATCAAAGCCA AGCTTTGAAC TCATTCGTAA GAAGCCGACG ACGTATCATT 8520 

TTGATTTTTG AAGAGTTTTA GAAATACTAC GATTTTTACC TTCCAGATAC ACCATCAAAA 8580 

TAGAAATATC TGCTGGGTTT ACTCCCGAAA TACGGCTGGC TTGGCCGATG GTTTCTGGAT 8640 

TGATGAGTTT GAACTTCTGA CGGGCTTCGG TTGCGATAGA ATCAATGTCA TCCCAGTCGA 8700 

TATTGGCCGG AATGCGTTTT TCTTCCATGC GTTTCATCTT GGCAACCTGG TCCATGGCTT 8760 

TGGAAATATA GCCTTCATAC TTGATTTCTG TTTCAATCAA TTCGATAATC TTGTCATCCA 8820 

AGTCTTCTGC AGCTGGTCCG ATGAAGGCCA CCACATCTTG GTAAGAAACT TCTGGACGGC 8880 

GAAGGAATTC CTTGGCTGTC ACTGCATCGG TCAAGGGTTT GAAGCCCATC TCCTCAACCT 8940 

TGGCATTGGT TTCCTTGACT GGCTTGAGTT TGATACTGTC TAGGCGCTTC ATCTCATTAT 9000 

CAAATTGATT TTTCTTGATT TCAAAACGAG CCCAGCGTTC ATCGTCCACA AGGCCAATCT 9060 

CGCGTCCCAT CTCAGTCAAG CGCATATCAG CATTGTCATG ACGAAGAATG AGACGGTATT 9120 

CAGCACGACT GGTCAAGAGA CGGTAGGGTT CAATGGTTCC CTTGGTCACC AAGTCGTCGA 9180 

TCATCACCCC GATATAACCA TCACTGCGCT TCAAAATCAA TTCAGGCTTG CCTTGGATTT 9240 

TCAGAGCCGC ATTGATACCC GCGATAATCC CTTGGCCTGC TGCCTCTTCG TAACCTGATG 9300 

TTCCATTTGT CTGACCAGCA GTGAAGAGAC CTGAGATTTT CTTGGTTTCC AAAGTCGCAC 9360 

GCAACTGATG AGGCAAGACC ATATCATACT CAATAGCATA ACCTGTCCGC ATCATCTCTG 9420 

CATTTTCCAA ACCTTTGATG GAATGCACCA AGTCACGCTG GACATCCTCA GGCAGACTGG 9480 

TTGAAAGTCC TTGCACATAG ACTTCCTCAG TATTGCGCCC TTCTGGCTCA AGGAAGAGTT 9540 

GGTGACGTTC CTTGTCCGCA AAGCGCACAA TCTTGTCTTC AATCGACGGA CAGTAACGAG 9600 

GCCCCACTCC CTTGACCACA CCTGTAAACA TAGGCGCACG GTGGAGGTTG TTTTGGATAA 9660 

TCTCATGACT GGTACCATTG GTATAGGTCA ACCAGCATGG TACTTGGTCC TTGACATAAT 9720 
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CCTCATCACG 


TGAAGTGTAT 


GAGAAATGAT 


TAGGCACTTC 


GTCTCCTGGC 


TGAATTTCTG 


9780 


TCACATCGTA 


ATTGATAGAA 


GAAGCCTTGA 


CACGTGGAGG 


GGTTCCTGTC 


TTGAAACGAC 


9840 


CGATTTCGAG 


ACCCAGTTCC 


TTGAGATTGT 


CAGCTAGGTT 


AATAGAAGCC 


AAGCTGTGGT 


9900 


TAGGACCTGA 


TGAGTACTTG 


AGGTCTCCGA 


TGATAATTTC 


CCCACGGAGA 


GCAGTCCCTG 


9960 


TCGTCACAAT 


AACAGCCTTA 


GCAGCATATT 


CTTGATGGGT 


GGCTGTACGC 


ACACCGACAA 


10020 


CCTTGCCATC 


TTCCACCAAA 


ATCTCATCAA 


TCATGGTTTG 


ACGAAGGGTC 


AGATTTTCTT 


10080 


GGTTTTCAAC 


CGTCTTGCGC 


ATCTCCTTAG 


AGTAAAGTTC 


CTTGTCAGCC 


TGCGCACGAA 


10140 


GGGCACGGAC 


AGCTGGCCCC 


TTCCCTGTGT 


TTAGCATCTT 


CATCTGGATG 


TAAGTCTTGT 


10200 


CAATGGTTTT 


GGCCATCTCG 


CCACCGAGGG 


CATCGACTTC 


ACGCACGACA 


ATCCCCTTGG 


10260 


CAGAACCACC 


GATAGAGGGA 


TTACAAGGCA 


TGAAAGCCAG 


CATTTCAATA 


TTGATGGTCG 


10320 


CAAGCAGGAC 


CTTACAGCCC 


ATACGGCTAG 


CGGCCAAGGA 


AGCCTCAACC 


CCAGCGTGTC 


10380 


CCGCACCAAT 


TACAATAATA 


TCGTATTCTT 


CAGTAAAATG 


ATAAGTCATG 


TTTCTCTCCT 


10440 


ATTCCTCAAG 


ATGAATGTGT 


CTTAGTTGGC 


CTTCCCAATC 


TGGTAGGGCT 


GTTTTTAAAA 


10500 


AGACTGGAAC 


TAGCTGGATA 


TTCTGGAGCT 


TATCCAAGTC 


AATCCACTCA 


CAGGGCTGCC 


10560 


TTTTCTCATC 


TTCCTGCATG 


GTCAACGGGG 


CATCTTCAAG 


CAAATCCACC 


AGATAATGAA 


10620 


ACTCGATATT 


GTGATAGGAA 


ACGCCGTCCA 


CTTCAAAACG 


ATTTTCAACC 


ACAAAAGCTA 


10680 


GCTGCCCAGC 


TTGAGCTTTG 


ACACCCAGTT 


CTTCCTTCAC 


TTCACGGACT 


ACCGCGTCTT 


10740 


CCGTGCTTTC 


ATTGACTTGA 


ATCGCACCTC 


CAATAGTGTA 


ATACTTGCCC 


TTGTCTTTGG 


10800 


TAACTAGAAG 


CTTGTGATTT 


TGGACAATCA 


AGGCTGTAGC 


CCGAACACCA 


AAAACCGTAT 


10860 


TGTCTACTTT 


TGTCCGAAAG 


TCTTGTTGAG 


TCATTCTTGT 


CCTTTCCCTT 


AAACGACACA 


10920 


AAAACAGTCA 


AAACTACAAA 


GAAGTGCAGG 


ACAAAAAAGC 


CTGCAACATC 


CAGG 


10974 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 987 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 
CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 
ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAAGT TTACACTAAG CCGCTAGgCA ATCGTCTATC AGAATATCCG 180 

TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AAGTCTATAA AAACTATCAA 240 

CATTTATATG ACTTGCGAAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 300 

CAAGATAAAA ACATGTGTAA GCAAATCTGC TACACTTTAC TGGAGGACGC CAAGAATAAG 360 

AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GGATTTGTAC AGACGATTCT 420 

AAACTTATCC AAAAAGGGTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 480 

CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 540 

GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 600 

GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 660 

GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 720 

TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTGGTAT TCGACTTGGT CATTCCCCTT 780 

ACCGGTACAA CCATGAGCAA TTGTAGTCGC TCCTATCTGA TGCGCTATTT CAACCAATTT 840 

TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 900 

ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 960 

ATAAGATTCT ACTGCCCAAA CCTTAAG 987 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 60 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCAGT CAAATAAGCA 120 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 240 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 360 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 420 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 480 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 540 
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CCTCTTTTTT ATTACTAAAA TTCAAAGAAT TCCAATGCTT TTTTCAAGAG CAAATCCGTA 600 

TATTCTGGAT CTTCTTGGGC TACTTCTATT TCCCGCTGAA CTTTTTCCAA ATCATCTGTA 660 

ATCACTCCAT CTACTCCTAA GTGAAGAGAT TTGCTGATAG CTTCTGAATC ATTGACAGTC 720 

CAGACATAAA GTTTCTGATC CGTTGTCCAT AGTTTGCTTA CAAAATATTC ATCCAAGGTT 780 

GAGTACTCCA TAGTATATCC TGTCGCTCTT GTTTTAGGAA AGACAGAATT GTAGGGCATG 840 

ATGAAATAAA CTGGTAGTTC GGCATCATAC TGTCTTACTT TTTCGACAAC ATGGTAGTCT 900 

AAAGACTGGA TTTGATGTCC ATAAATCTTG AGCTTTGCAG CATAACGGGC TAAAAAGCGG 960 

TTCATCATGT CTGGACTATC TTTTTTACTG GTTTTAATTT CAATTAGTAA TTTTTGACCA 1020 

AGTTCGTTGG CTCGACTGAG ATAATCTTCA AAGCTTGAAA TTTTAGTCTG GTAGCCATTT 1080 

TCAAAAATAT CAATCCCTTT AAGCTCCTCC AAGTTTAAGT CTTGAGGACT TTTATTGATA 1140 

CCTGCTAGAT TTTTCAAGTT AGCATCATGC ATCATGACAA ACTGCCCATC TTTTGTTTCC 1200 

TGCACGTCCG TCTCCACCAA GTCTGGTTTG AGTTGTGCTG TAGTTTCCAA GGACTCTACT 1260 

GTATTTTGAA TCCCATTTGC ATTGGAAACC CCTCGGTGAG AAATAAGTTG AGGTAGATGA 1320 

ACCATGGGAG CCTCCAGATA AATATAACCT TCTAAGGCAA AGAAAAGACT GGCACAAGTC 1380 

ATGACACCCC ATCGCACGAT GTGATCTTTT TCTCTCCTAG GAAGCATATC CAGCTCCTTT 1440 

CCTGTCAAAA ATGAAACAAA TTTAACCAAA AAATAAGTCA GAGCCATATA ATAGAGATTT 1500 

TTAATCACGA CAAAATTCAA AATACCAAGA ATCAGAGACT CTCTCTGAGT GATATCATCT 1560 

ACCAAAGTTT GAGCCAATAA TAAAGGAATC AAAGGAAGAT ArAATAATAA ATGTGCTTTG 1620 

AGCAAGATGT AAAATAAATT CCAAGCATAA AAAGTAACTC TCTTCTTGGT TTTCTCCAAG 1680 

CTAAACATCA CTGCTTCTCG AACAGTCAGC TGATCATATA CAATCTTCGG AAGGGCAAAC 1740 

ATCAATCTGA CAGAGACATA GAGAAAGATA AGAGATAGAA GTAGGATGCT CAGCCACCAC 1800 

ATCCAATATC TATCTTCTAA ATAAGCTTGG ATAAACTCTG GAATGACGAT TTTATTAAGA 1860 

TAATAAATCT TCAGCATTTT CCGTATAAAA GGAAACAGCA TAGCTATATA GAAAAAGATA 1920 

AACAAGGCTT TAGCGCAAGT TAGCTTTTTC ATAAATCCAA AACTTTCATG GAAAACCTTG 1980 

CGGATATACT CAATTAGCCT TCGCTTTTCA TTATAGAGGA GATGACGAGC ACCAATAAAG 2040 

AGGAGTCCTA TTTGAAAATA AGCAACCAGA AGGTTAATTA CAATCAAGGC TAAAAAAGCT 2100 

AGACTAATCA ATGGAGAATG AGTAAGGATG GCTAAGACAT TGTTATAGGA AATAAAAAGA 2160 

TAACCTGTCT GATCTAATAA GAAGCTAGCC AACCATGAAT TGAATGGTAC CCACAAATAC 2220 

TCCACTATCA TAAAAATCAA GAAAAATAGA AAGAGGATTT TATCAAGATC GAGGTAAATC 2280 
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TGTTTAAGAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 2340 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTGTC TCTAACAATT 2400 

CCCTAGCTTG ATCCGACTCT AAGAAGGATT CGTAAACACG CGCCGTCATC CGAGCATCCT 2460 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA GGCAACAGTT TGCAATTTGA 2520 

GATTGGCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTAAACCAT GCTCTGCTAA AATAGGTAAA TCACTTTTAG 2640 
CAGCATTGTA G 

(2) INFORMATION FOR SEQ ID NO: 217; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2651 



60 
120 
180 
240 
300 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 
CGTTATAATA AACTTGTGAA AAAATTAACA AAGGATATCG TTCCTTGAAA GCTATGGAGG 
AAAATATGGC TGATAAAAAA ACTGTGACAC CAGAGGAAAA GAAACTCGTT GCTGAAAAAC 
ACGTAGATGA GTTGGTTCAA AAAGCTCTAG TTGCCCTTGA AGAAATGCGT AAATTGGATC 
AAGAACAAGT TGACTACATC GTTGCCAAAG CATCAGTAGC AGCTTTGGAT GCCCACGGAG 
AATTGGCTTT ACATGCCTTT GAAGAAACAG GACGTGGTGT ATTTGAAGAC AAAGCAACTA 
AGAACTTGTT TGCCTGTGAA CACGTAGTAA ACAACATGCG CCACACTAAG ACAGTTGGCG 360 
TTATCGAAGA AGACGATGTA ACAGGATTGA CTCTTATTGC TGAACCAGTT GGTGTTGTTT 420 
GTGGTATTAC TCCAACAACA AACCCAACAT CAACAGCAAT CTTCAAATCA TTGATTTCAT 480 
TGAAGACACG TAACCCAATC GTCTTTGCCT TCCATCCATC AGCACAAGAA TCATCTGCTC 540 
ATGCAGCTCG TATCGTCCGC GATGCAGCTA TCGCAGCTGG TGCTCCTGAA AACTGTGTGC 600 
AATGGATTAC TCAACCATCT ATGGAAGCAA CAAGTGCCCT TATGAACCAC GAAGGTGTTG 
CGACAATCCT TGCAACAGGT GGTAATGCCA TGGTTAAGGC GGCTTATTCA TGTGGTAAAC 
CAGCTCTTGG GGTAGGTGCC GGAAACGTTC CAGCTTATGT TGAAAAATCA GCAAACATTC 
GTCAAGCAGC ACACGATATC GTCATGTCTA AATCATTTGA TAACGGTATG GTCTGTGCAT 840 
CTGAACAAGC AGTTATC ATT GATAAAGAAA TTTACGATGA ATTTGTAGCA GAGTTCAAAT 900 
CTTACCACAC TTACTTTGTA AACAAAAAAG AAAAAGCTCT TCTTGAAGAG TTCTGCTTCG 



660 
720 
780 



960 



GCGTCAAAGC AAACAGCAAA AACTGTGCTG GTGCAAAATT GAACGCTGAC ATCGTTGGTA 1020 
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AACCAGCAAC TTGGATTGCA GAACAAGCAG GATTTACAGT TCCAGAAGGA ACAAACATTC 1080 

TTGCTGCAGA ATGTAAAGAA GTTGGCGAAA ATGAGCCATT GACTCGTGAA AAATTGTCAC 1140 

CAGTTATTGC AGTTTTGAAA TCTGAAAGCC GTGAAGATGG TATTACTAAG GCTCGTCAAA 1200 

TGGTTGAATT TAACGGTCTT GGACACTCAG CAGCTATCCA CACAGCTGAC GAAGAATTGA 1260 

CTAAAGAATT TGGTAAAGCT GTTAAAGCTA TTCGTGTTAT CTGTAACTCA CCTTCTACTT 1320 

TTGGTGGTAT CGGGGACGTT TACAATGCCT TCTTGCCATC ATTGACACTT GGATGTGGTT 1380 

CTTACGGACG CAACTCAGTT GGGGATAACG TTAGTGCCAT TAACCTCTTG AATATCAAAA 1440 

AAGTCGGAAG ACGGAGAAAT AACATGCAAT GGATGAAACT TCCTTCAAAA ACATACTTTG 1500 

AACGTGATTC AATTCAATAC CTTCAAAAAT GTCGTGACGT TGAACGTGTC ATGATCGTTA 1560 

CTGACCATGC CATGGTAGAG CTTGGTTTCC TTGATCGTAT CATCGAACAA CTGGACCTTC 1620 

GTCGCAATAA GGTTGTTTAC CAAATCTTTG CGGATGTAGA ACCGGATCCA GATATCACAA 1680 

CTGTAAACCG TGGTACTGAG ATTATGCGTG CCTTCAAACC AGATACCATC ATCGCACTCG 1740 

GTGGTGGGTC TCCAATGGAT GCTGCCAAAG TAATGTGGCT CTTCTACGAG CAACCAGAAG 1800 

TGGACTTCCG TGACCTTGTC CAAAAATTCA TGGATATCCG TAAACGTGCC TTCAAGTTCC 1860 

CATTGCTTGG TAAGAAGACT AAATTCATCG CGATTCCAAC TACATCTGGT ACAGGATCTG 1920 

AAGTAACACC ATTTGCCGTT ATCTCTGATA AAGCAAACAA CCGTAAATAC CCAATCGCTG 1980 

ACTACTCATT GACACCAACT GTGGCAATCG TAGATCCTGC TTTGGTATTG ACAGTTCCAG 2040 

GATTTGTTGC TGCTGATACT GGTATGGACG TATTGACTCA CGCGACAGAA GCATACGTAT 2100 

CACAAATGGC TAGTGACTAC ACTGATGGTT TAGCACTTCA AGCCATTAAA TTGGTCTTTG 2160 

AAAATCTCGA AAGCTCAGTT AAGAATGCAG ACTTCCACTC ACGTGAGAAA ATGCATAACG 2220 

CTTCAACAAT CGCTGGTATG GCCTTTGCCA ATGCCTTCCT AGGTATTTCT CACTCAATGG 2280 

CCCATAAGAT TGGTGCGCAA TTCCACACAA TCCACGGTCG TACAAATGCT ATCTTGCTTC 2340 

CATACGTTAT CCGTTACAAC GGTACACGTC CAGCTAAGAC AGCAACATGG CCTAAGTACA 2400 

ACTACTACCG TGCAGATGAA AAATACCAAG ATATCGCACG CATGCTTGGA CTTCCAGCTT 2460 

CTACTCCAGA AGAAGGGGTT GAATCTTACG CAAAAGCTGT CTACGAACTC GGTGAACGTA 2520 

TTGGGATCCA AATGAATTTT AGAGACCAAG GAATTGACGA AAAAGAATGG AAAGAACATT 2580 

CTCGTAAATT AGCCTTCCTG GCTTATGAAG ACCAATGTTC ACCAGCTAAC CCACGTCTTC 2640 

CAATGGTAGA CCATATGCAA GAAATCATCG AAGATGCATA CTATGGCTAC AAAGAAAGAC 2700 

CAGGACGCCG TAAATAATTG TTTATCAGTC TAGAAGCAAG ACAAAAACTC AATTTGAGGG 2760 
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AAAGATCCAG TAATTTTTCT 


ATGATAAAAfi 




AAVjtj 1 1 1 1 lb 


AACACCTGAT 


2820 


AGGATGCCTT TTTATGATAT 


TGAGGnrTTT 


1 luLLL 1111 


1 uAAAAAL 1 A 


GAATAGAAAC 


2880 


AAAATATATA ATAGATTGAA 


ACT Aft A ATfifl 


1 ALA 1 A 1\. l\j 


LUL1 AAAAL 


ATTGTTAGAA 


2940 


TTCGATTTGA CTGTCCTGAT 


CGATTTfiTPC 


lu 1 1L1 lnl 1 


1 LA 1111 UA i 


ATATAAAAAA 


3000 


TATAGTATAG TAGACTGAAT 


PTAAAATACJT 


ALuAAALAA I 


1 vjt_ 1 AAAAL A 


TTTATAGAAA 


3060 


TTAATTTTAC TTTTCTGATA 


\3r\\j 1 lull L. A 


LAI Li 1A1 1 1 


LAATTCACTA 


TAGTTTAATT 


3120 


TAAGAGTAGT ATTTACTAAG 


va v>LV_AM I InA 




LAAACTAGAA 


AACGAGTGCC 


3180 


ATTCAGCTCA AAACACTGAT 


1 luAuAl 1\jL 


AL*A 1 AALtAL 1 


AGCCCCCTCA 


TTAACAGATT 


3240 


TACGATAAAA CGATGACAAG 


GTGTGTTGCT 


TTTTGATTTC 


TAAAGAGTAT 


AATGATAGAT 


3300 


CTCTATAAAA TAAGTGCGAA 


GGAAATGAGC 


TTTTATAGTC 


CTTTCGTTTT 


AAAATACTAT 


3360 


CTCAGATATT CTTATATCGA 


CAAGAAGTTT 


TTGAGTCATT 


CCCTCATCAT 


ACATATTAAA 


3420 


TAAATAGTGG CTCATTCAAT 


TTTTCACTAG 


AATAATAAGC 


TAGTATAGTA 


AACTGAAATA 


3480 


AGATATAAAC AAATAAATTG 


GAGCTTAACA 


TCCATTTCCA 


GCAATTTTTT 


AGAAACTACA 


3540 


GTGGACTATT CTAGATTCAA 


CATATTATAA 


AAACTAGAGT 


AAAAGAAAAG 


GATTGGATCT 


3600 


TGTGTAATGC AGGATCCAAT 


CCTTTCAATC 


ATTTTGTCCA 


ACTTTTGGAG 


GTTCCTACAA 


3660 


TGTAGTCGTC ATTAATAAAG 


ACAGATGGGA ATGACAGTGT 


TCCTATTTAT 


TTTGATAGAG 


3720 


ATCGATGAAT TCTTTAGATA 


GCAACTGAAT 


AATCTCTGTT 


GAAGCCATTT 


GGTCTTCTGC 


3780 


ATGCATAAAT AGCAAGGAGA ATCCTATTTT TTCTCCAGTA GCTTCTTTTT 


GTATGAGATT 


3840 


AGAGTGAATC TTGTGCGCTT 


CTACTAAGGA 


GTCTTCCGCT 


TCTTCAACTT 


TAATTTTCGC 


3900 


TTCTTTTAAA TTTCCTGCCT TAGCTAGTTG GATGGCTTCA ATAAAGGATG 


ATTTGGCTG C 


3960 


TCCACTATTG GCAATGAGCT 


GAAAACAGAT 


ATATTCCATT 


TCTTCTGTCA 


TCTTATTTCT 


4020 


CCTATCCATG CAAGTGCTTG 


TTCCAGAACT 


TTTGCTCCAT 


TCATCATTCC 


Ks i AA 1 L L LW_ 


4080 


ATATCAATGG TATCTACAGG 


GATATTTCCT 


GCAATTTCTT 


TCACAGCAAG 


TAACTCATAA 


4140 


CGAATTTGTG GCCCAATTAG 


AATGACATCT 


GCTTCATGGA 


TATTCTTTTT 


AGCTTCTGTC 


4200 


ATTGATTTTG CTTGGATAGA 


GATTTCAATC 


CCACGTTCAG 


TCGCACTTTG 


TTGCATTTTT 


4260 


TTAACAAGCA TACTTGTCGA 


CATTCCCGCA 


TTACATACTA 


ATAAAATTTG 


TTTCATAATC 


4320 


TTAACCTTCC ATTTCTTGTT 


CAACAACTTT 


GTCATTAACT 


TTGATAAATG 


GAATGTATAG 


4380 


AAGAACTCCA AGTGCAAAGA 


TGATGAATTG 


AACTAGAACT 


GCTCTCACGT 


CCCCTGCTGT 


4440 


TGCTAACCAT GCATTTAAGA 


ATACTGGTGT 


AGTCCAAGGA 


ACTTGTATAA 


ATGCAGGACT 


4500 


CATGAATTCT GTAACTGTTG 


CTAAGTAGCT 


GATTAAAATA 


CCAAGGACTG 


GAACTGTGAT 


4560 
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AAATGGAATA GCTAATGAAA TGTTATAAAC GATTGGGTAA CCGAATAATA CTGGTTCATT 4620 

GATATTGAAG ATACCAGGTC CAAAAGATAA TTTAGCCACG TTTTTAGAGA CAGCATTGCG 4680 

ACTCACTAAG AATGTTGCTA TTAATAAACA TAATGTAGAT CCACTACCAC CCATTAAAGC 4740 

GAATGTTTGT ATTTGTGATA GGTTGATGAT GTGTGGAATG GCTTGTCCAT TATTTGCTGC 4800 

AGTGATGTTT TCAGTAATGT TAATTAATAG TAATGGTTCT AGGATGGCAC TGTAAATAAC 4860 

TGCTTGGTGA ATACCAAATA GCCATAACAT ATTTCCTAAA GAGTAAATAA TAATGACCCC 4920 

GATTAAGCTT GTACCAATAT GACGAATTGG TTCTTGAATA AAGATTGTAA TGATTGAGAT 4980 

TAAGTTCATT CCAGTTATAT TGAATAATAA TGCTGAAACA ACCCCAAATA AGGAGATGAC 5040 

GGTCATGACT GGAAGTAATA CGCTAAATGA TCTACTAACA GCTGGTGGAA TATTTTCACC 5100 

AAGGTTCATT TGTAAAGCTT TAACGTTTGA TAATTCAATG AATAATTCTG TTGCAATAAT 5160 

CGtACGATAA CCCCGGCGAA CATTGCGCCT GTACCTGTGT TGTTGAATGA AAGAACACCT 5220 

GAAATGTTTA CCGCATCTTT TGCTCCGTCA GGAACTACAG AAACTGTATT TGGCATCATC 5280 

ACAATTAAAG AAACTAATGA TAGCATTGAT GCTGCTAACG GGTTTTCGAA ATCTCTGTTT 5340 

TTAGCTAAGA AATAACCAAC CATTACAGCA ATAATCATAC CTGAAATACT TAAAGTACCG 5400 

TTTGCAATTG TTATTCCCCA ATATTGGAAT CTTGTTAATG TATCCCCTTG GAAAATCCAC 5460 

TTAAATACCG TGTTGTTCAA AAGAACGATT AAACCTGCCA AAATATATAA TGGCATTACT 5520 

GTTACGAATG CATCTCTTAG GGTTTTTAAA TGAATTTGGT TCCCTAGTTT ACCAGCAAAG 5580 

GATGGCAAAA AAATTTTTTT GGGGGGGGGG GTTATTAAAC CCCCCTTTTT AAAAAAAA 5638 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAATGAATT CCAACAATCT GTATCAGATA TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTGTAG ACCACCACCG TGTGGCTAAC TTTGAAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC GTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 300 
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GCTGTGCCTA AAGAGATTGC AGGTTTGATG CTTTCAGGTT TGATTTCAGA TACCCTTCTT 360 

TTGAAATCAC CAACAACACA CCCAACAGAT AAAATCATTG CTCCTGAATT GGCTGAATTG 420 

GCTGGTGTGA ACTTGGAAGA ATATGGTTTG GCAATGTTGA AAGCTGGTAC CAACTTGGCT 4 80 

AGCAAATCTG CTGAAGAATT GATTGATATC GATGCTAAGA CTTTTGAACT CAACGGAAAT 540 

AATGTCCGTG TTGCCCAAGT GAACACAGTT GACATCGCTG AAGTTTTGGA ACGCCAAGCA 600 

GAAATTGAAG CTGCAATGCA AGCTGCCAAC GAATCAAACG GCTACTCTGA CTTTGTCTTG 660 

ATGATTACAG ATATCGTCAA CTCAAACTCA GAAATCTTGG CTCTTGGTGC CAATATGGAC 720 

AAGGTCGAAG CGGCTTTCAA CTTCAAACTT GAAAACAATC ATGCCTTCCT TGCTGGTGCC 780 

GTTTCACGTA AGAAACAAGT GGTACCTCAA TTGACTGAAA GCTTTAATGC GTAAGATTTT 840 

GGGTGTCAGC TCAAAATCGG AAAGTCTAGT TTGCCTTATA TCGCAAGGAG TTTCGGCTCC 900 

TTTTTTCTAG GAGTGAAGTA TGTTAGAAAA TGGCGATTTG ATTTTTGTGA GAGATGGGTC 960 

AGACATGGGA CAGGCCATCC AGACTTCCAC AGGTAACTAT AGCCATGTTG CCATTTATTT 1020 

GGATGGGATG ATTTATCATG CTAGTGGACA GGCTGGTGTT GTCTGTCAAG AACCGGCAGA 1080 

CTTCTTTGAG TCCAATCATT TATACGACCT CTATGTTTAC CCAGAAATGG ATATCCAGTC 1140 

GGTGAAGGAA AGAGCTTGCA AACATCTTGG AGCACCCTAC AATGCTTCTT TCTATCCAGA 1200 

TGCAGCTGGT TTTTACTGCT CCCAGTATAT AGCAGAAATC CTACCTATTT TTGAAACTAT 1260 

TCCTATGAAA TTTGGAGwTG GGGAGCAGGA GATTAGTGAT TTTTGGAGGG AGTATTACAT 1320 

AGAACTAGGT CTGCCTGTTC CTCTGAACCA AGCTGGTACC AATCCTAGTC AGTTGGCAGC 1380 

ATCGCCTCTG TTACAATGTA AAGAAAGGAA TCTTCATGAT TCAGATTTTT AATCCATCTC 1440 

GTTTGACGAG ACAGCCATTT TTGGAGAATT GATCCGCTAT CTGGATCAGT ATGAGGATGT 1500 

GATTCTACGG GAAATTAAGG CTCAATTTCC AGATGTTGCA GTTGATAAAC TCATGGAAGA 1560 

GTATATAAAG GCAGGCTTGA TTCTACGTGA AAATAAGCGC TATTACCTCA ATTTTCCTAC 1620 

GCTTGAATCA CTTGATAGTC TTGAACTGGA TCAAGAGATT TTTGTCAGAG AAGCTAGTCC 1680 

GGTCTATCAA GCCTTGTTGG AGCAGAGTTT TGAGACGGAA TTGCGCAATC AAATCAATGC 1740 

AGCTATTTTA GTTGAAAAGA CGGACTTTGC GCGCATTAAA ATGACCCTGT CCAATTATTT 1800 

TTACAAGGTC AAACAGCAGT ATCCTTTGAC AGAAAAACAG CAGGAGCTCT ATGACATTTT 1860 

AGGAGATGTT AATCCTGAGT ATGCCCTCAA GTATATGACG GCTTTTTTGT TGAAATTTCT 1920 

CAAAAAAGAC CAGCTTATGC AGAAATGCCG TGATATCTTT GTGGACAGTT AGGTTGTCTT 1980 

AGGCTATATT GTGCAAAATG AAGATGGAAA GTATGAGTTG GCTATCGATT TTGATAAGGA 2040 

GAGGTTAACT TTCTACTTAG CGTGATTTCT TGTTTCTGAG TACATTGTTT GACTTTCCTT 2100 
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AGTATTCGGT ATAAACTATA TGTAACCGGT AACACATATC GGAATAAACT AAAGGAGACA 2160 

ATCATATGTC ACTTGAAAAC AAATTGGAAC AAGCAACAGG CGCTGTCAAA GAAGGTTTTG 2220 

GTAAAGTTAC TGGAGACAGC AAGACAGAAC TTGAAGGAGC TGTTGAAAAA ACAGTTGCTA 2280 

AGGCAAAAGA CGTTGTAGAA GACGCAAAAG GTGCTGTAGA AGGTGCCGTT GAAGGTTTGA 2 340 

AAAACGTTTT TACTAAAGAA TAGGAAAAAA TCAAGGGTTT CATTTTCCCT TGATTTTTTC 2400 

TATTCTTATA AATAATTTTC TGCGACGGCT GTATCTCCTG GGTAGGATTC TTTCTTGCCC 2460 

TGGATGATTT GGTAACAATC GGCTCCCTTA CCCGCAATAA TAACTGCATC TAATTCGTGA 2520 

TTTGTGATAG CCATTGCCGC CTTGATGGCT TCTTGGCGAT CCGCAATCTT TTCAACAGGA 2580 

TGATTGATGT AGCTACTAAT TTCATCTGCA ATGGCCATTG GGTCTTCATA GTTAGGGTCA 2640 

TCAGCAGTCA GAAAGACTTG AATCTCAGGG TGTTGATTGA GGAGGAGGCC AAAGTCCTTA 2700 

CGACGACTTT CTCCCTTGTT TCCTGTTGAT CCCAGAACCA GAGCAATCTT TCCGGTTTGA 2760 

TGAGTTTCAA CCACATTGAT GAGTTTTTTC AGACTATCCC CATTGTGGGC ATAGTCGATG 2 820 

AAGACCTTGG CTCCATTTTT CTGAGTGAGG ACTTCCATAC GACCAGGAAC GCGGGTTGCA 2 880 

GCGATGCCTT TTTTGATGTC CTCAAGACTT GCTCCGAGAC GGAGACAAGC AAGTCCAGCA 2940 

GCAACTGCAT TTTCTTGGTT GAAGTTGCCA ATGAGTTGAA TATCATAATC TCCAGCGAGT 3000 

TTACCCGTAG CTGAAAAGCT AAAGGCTTTG GAATTCTCGA TTTGGTTATC AAATTGGCTA 3060 

CCATAGAAAT CATGGTCTTG ATCTTCAACC TGTTCTTTCA AGACTGAGAA GTGGTCCATG 3120 

TCACTGTTAA TGATGACTGC TCGGCTCTTT TCCATCAAGA GACGCTTGTG GTAGAAATAG 3180 

TCTTCAAAGC TAGGGTGTTC AATCGGGCCG ATATGGTCTG GGCTGATATT TAGGAAAACT 3240 

CCCACATCAA AGGTTAGACC ATAGACACGT TTGACCAGAT AGGCTTGACT GGAGACTTCC 3300 

ATGATGAGGT GGGTACGGTC ATTTTGCACA GCCTGATTCA TCATGTCAAA GAGGTCAATA 3360 

CTCTCAGGGG TTGTCAACGC TGACTTAAAG AAAGTCTCGC CATCAAGAGT TGTGTTCATG 3420 

GTCGACAACA TAGCAGGTCT ATGCCCTTGA GATAAGATGT TATAGGCGAA ATAGGCTGCT 3480 

GTTGTCTTAC CCTTAGTACC AGTAAAGGCA AGGAGTTTGA GTTTTTCCTG TGGATTACCA 3540 

TAGAACTCCA TGGCAATCAA ACTCATGGCT TTCTTTATAT CGTTCACAAT GATGACAGGG 3600 

ATACCGACTT CGTAGTCCTT TTCAGCTACA TACCAAGCTA ATCCTTGTGT TATAGCAGAA 3660 

AGAAGGTATT CTTTTTTAAA GGCAGCGCCT TTTGCGAAAA AAAGAGTGTC TTCTGTTACT 3720 

TTTCGGCTGT CGTAGCTGAT GCTATCAAAA ATAACTTTGC TGTAGTTGTA GTGGTAATGA 3780 

CCTTGGTCAA TAATTTCGCG AAAAAGGCCA TCTTTCTTTA AAATATCTAA TACGGTTTCA 3840 
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ATCTTAATCA TACTTTCTAT TGTAAACCGA AAGTCGTAAA TTTACAAGTA ACAAGGAAAA 3900 

GTTTATAATG GAAGATAAGG AGTTTTTCCT AGTTATCAAA ATTGAATGAG GAATCTATGT 3960 

CGCACGAAAA CAATCACCAG CAGGCCCAGA TGTTACGGGG GACTGCTTGG CTAACGGCTA 4020 

GTAACTTTAT CAGTCGCCTA CTCGGGGCTG TTTACATTAT CCCTTGGTAC ATCTGGATGG 4080 

GGGCTTATGC AGCTAAGGCA AATGGTCTCT TTACCATGGG TTACAATATC TATGCTTGGT 4140 

TCTTGTTGGT TTCAACAGCG GGGATTCCAG TTGCGGTGGC CAAGCAAGTT GCCAAGTATA 4200 

ATACCATGCG AGAAGAAGAG CATAGCTTTG CCCTGATTCG GAGCTTCTTA GGCTTTATGA 4260 

CAGGACTAGG CCTGGTTTTT GCTTTAGTCT TGTATGTCTT TGCTCCTTGG CTAGCAGACT 4320 

TGTCTGGCGT GGGCAAAGAC TTGATCCCAA TCATGCAAAG CTTGGCTTGG GGAGTCTTGA 4380 

TTTTCCCGTC TATGAGTGTT ATCCGAGGAT TTTTCCAAGG GATGAATAAC CTCAAACCCT 4440 

ATGCCATGAG CCAAATTGCT GAGCAGGTCA TTCGTGTTAT CTGGATGCTC CTAGCAACCT 4500 

TTATCATTAT GAAGCTCGGT TCAGGAGATT ATCTAGCAGC CGTTACCCAA TCAACCTTTG 4560 

CTGCCTTTGT CGGTATGGTA GCCAGTTTTG CAGTCTTGAT TTATTTCCTT GCCCAAGAAG 4620 

GTTCACTCAA AAGAATCTTT GAAACAGGAG ATAAGATTAA CAGTAAGCGT CTCTTGGTTG 4680 

ATACCATTAA GGAAGCCATT CCTTTTATCC TGACAGGGTC TGCCATCCAG CTCTTCCAGA 4740 

TTTTG 4745 
(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 120 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCGCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 240 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 300 

AAGGCAAGAA TACTGATTGG ATCGACTTGG ATTGTTTCTG CCGACTGGTA AAGCATCTGA 360 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 420 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC G AT ACT C ACT TCTGTTTGCC TTTAAATCGC 480 
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CATTGGAAGC 


GGAgCTTGTC ATAAAAGGGA AACTCGATAA ACAGGACTCC 


CAAGCCCACA 


540 


CAGAGACTGG 


CAAGGACGTC TGATGGGTAA TGAACTCCCA GATAGACTCT 


TGATACCAGC 


600 


ACACTGACTA 


GGTAGAGGCC AAGGACGATT TGTACGATTT TTCTCCAGAC 


CTGATCTTTA 


660 


ATCCGCTGAC 


TAAGAATAAC AATCAAAGTC CCTACCATCA GCGTTACAGC 


TAGAGAATGC 


720 


CCACTTGGGA AGGAAAATCC CTTCTCCTCC ACCAGATGTA AAATAGCTGG 


TCGTGGGCGC 


780 


TGGTAGATAT 


TTTTAAAGGT CACGATTAAA AGACCTGCCA AAGCCAGATT 


TCCCAGCATG 


840 


AAGAAACTTT 


CTATCTTCCA TCGCTTACGA TAAAAGACAA AAGCTGTAAT 


GACAACCCAA 


900 


GTGATAATCA 


CTGGGATATC AATCAGACGT GTGAGGGCTC GAAAAAGAAT 


AGTCAAATAA 


960 


TCTGGTAAGT 


CTCCTCGAAT GGCAGTCTGA ATCGATTGGT CAAAATTGAC 


CAACATTTCA 


1020 


GGGTAAAATT 


TGACCATGTA GCCAAGAATA ACGAAAAGTA AAAGGGCAAA 


ACTGCCCTTC 


1080 


ATTAAAAATG 


TTTGTTTATC TCTCATAATG TTTTAAGGTT GGTTTCAAGA 


GAACATACAA 


1140 


CAACCAGAAT 


GAAACGGAAA AGATAACACC TTCAATCAAG TTAAAAGGTA 


ATACCATGGT 


1200 


CATTAGGTAG 


TTGGAAAGTC CCAAAATTTT TCCAATATCA AAGTTAGCAA 


ACTTAGCGTA 


1260 


CAAAGGAACA GCATAAACAT AGTTGAGAAC CAACATGGCC AAGGTTAAAC 


CAATAGTTCC 


1320 


AGCTAGAGAG 


CCTAGTAGGA AACGAAGGGT TGTCCGTTCC TTTTTCCAAA 


TCAAAGCAAA 


1380 


TACGATGACA AAAACTCCCA AAGCTACGAT ATTCATCGGC AAACCAATGT 


AAGTATTCAC 


1440 


TCCTTGGCTG 


TTAAGAAGCA ATTTCAAGAG TGAGCGAAGC AAGAGCACTC 


CTAGAGmCsC 


1500 


AGGCAAATCC 


ATGACCACCA GACCCACAAG GACTGGCAAG ATACTAAATT 


CGATCTTGAG 


1560 


GAAAGATGCC GCTGGTAAAA GCGGAAAGTC AAAGTACATC AGCACAAATG 


AGATGGCTGA 


1620 


TAGAATTGCA ATGGTCGAAA GTCGACGTGT GTTTGTCATA ACAGGTTCCT 


CCAATTTTCT 


1680 


ATAAAATCAG 


AAGAAGTTGG AAAGGATTCC TCTATCTATT CTCACTTTTT 


ATATCCCAAA 


1740 


AGTTCCCTCT 


TACTCTATTA AAGAAAAACA AAGCAAGTGG TTACAATCCG 


GCTATAAATC 


1800 


TATCAAAACA GACAAGGCTA TTCTTTCGTC TTCTCCCATC CAGACTATAC 


TGTCGGTTGT 


1860 


GGAATCTCAC 


CACATCACGT TGCGCTCACG GACTTCTTTA 




1900 


(2) INFORMATION FOR SEQ ID NO: 220: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4692 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID^O: 220: 
GGTTTTCCAG CAGGAGCTTC TCCTTTATCA GAATGACCAT CCCATCTGCT CACGATAGAT 
GAATAATGAT ATTTTTTACC ATGATAGTAA TTTGAAAAAG CCTAACCACC TCCTGAACCT 
TCTCCATATG TCCATACTCC TCCATCTGGA TATTATACAG CAGCTGATGC AGCTCCCAAT 
AATGTAAAAC TTGAAATAAG AGCTAGAGCA AGTAATCTAT GTTTTTTCGT TTTCATTTTA 
TTTTTTCTTT CAAAAAAAGC ACACCTTGAG CAACAATGCA ACAAAATAAA TCCTCCTCTC 
TCTTTTATTG AAACCGCTTT CTTATGTGAT AAGAATAACT TTTTTATtAT TTGTTGTCAA 
GGAAAAAATC GAATTTTTTA GATATTTTAC TATATTACCT CTGTGAATAA TATTATATAG 
TAGTTTTATT TCAAAATAAT ATGCAACCAG TACTAACCAA ATATAAAATA GATGCCATTA 
ACGAATTTTA TTCAAGTTTT TCCCATTCAT ACTATACAAG TAAAAGAGAT GGTGTTAACT 
AAAAAGCAAT TCAAACTATT GTAAAATTCC TAGCAAAAAG AGAGCCGAAA CTCTCTTTTT 
TATCTTCTTT TACTTTTTTT GACTGGCATG AGTGTGATGT CTCTAACACT AAAGTAAGCT 
AGGATCAACA TGGCTATTGC TAGGAATATT TCTGTTGGTA ATTGAAAAAT TTTCAGAAAA 
GATAGAACCA ATAAAATCAA GAGTGCCACT AAAATACATA CCATAGCGAC GATATTGACA 
GTCCCTTTAA TGCTTTCTGG TGTCGCAAAT ACATAGAGTA GGAGCAGTAA AATTCCTAGG 
ACTAAATAGA CCATCTTTCT CTCTTTCTAG CTCTTATTCA GCTGATTTTT TCTTCTTGTT 
AGCTTTCTCA CGCTCTGCTT TGTTAAGGAT TTGTTTACGC AAACGGATAG ACTCAGGCGT 
TACTTCCATG TACTCATCGT CGTTCAAGAA CTCAAGAGAC TCTTCAAGTG TCAAGATACG 
AGGCGTCTTG ATAACAGCTG TTTGGTCCTT AGTAGCTGAA CGAACGTTGG TCATTTGTTT 
TGCCTTCGTG ATGTTAACTG TCAAGTCATT TTCACGAGAG TTTTCACCGA TGATCATTCC 
TTCATAAACC TCAGTACCTG GGTTGACAAA GATCGTACCA CGTTCTTCGA TAGACATGAT 
TGAGTAAGTT GTAGCCTTAC CAGCATCGAT AGAAACAAGG GCACCACGGT GACGTCCACC 
AATTTCCCCT GGAATCAATG GCAAGTATTG GTCGAAGGTA TGGTTCATGA TACCGTAACC 
ACGAGTCATT GATAAGAACT CAGTTGAGTA TCCAATCAAA CCACGCGCTG GAACAAGGAA 
GACCAAACGA GTTTGACCAT TACCAGTTGA AATCATATCC AACATTTCAC CTTTACGTTC 
AGAAAGGCTT TGGATAACAG ACCCTTGGTA TTCTTCTGGA GTGTCGATTT GTACACGTTC 
AAATGGTTCA CATTTAATAC CGTCGATTTC TTTTACGATA ACTTCTGGAC GAGATACTTG 
AAGTTCATAG CCCTCACGAC GCATTGTTTC GATAAGGATT GACAAGTGCA ATTCTCCACG 
TCCTGAAACA GTCCATTTAT CTGGTGAATC AGTTGGGTCA ACACGAAGGG AAACGTCTGT 
TTGCAATTCT GCCTGCAAGC GTTCTTCCAC CTTACGAGAA GTTACCCATT TACCTTCTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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ACCAGCAAAT 
TAGGATTGGA 
TTCCATACCT 
CAAACCAAAG 
AGAAAGGGTA 
ACGTCCAACG 
GTTATCTACT 
TTCTTGGTCA 
CACTGGGAAA 
ATCCACTACT 
GACAAGGTCT 
ATAGGCATCT 
ACCAAAGTCC 
GGCAGTATTT 
AGCACGCTCT 
AACCAGGGTT 
TCTTAATTTT 
ATACCATAAT 
TTCTTTTCAC 
GTTTAGATAA 
TCAGAGGGGA 
CAGGACTACA 
TTATGCTTCA 
TCATGGACCT 
ACCGAGATAC 
TCCATCCCCA 
CTGACCATAT 
CCGCAAAACT 
CAGAAGGAAA 



GGTGAGTTGT 
AGAGCTTCTA 
GAAACGGCAA 
AAACCGAAGA 
ACTTGGTCCC 
AAGTCATTGT 
GGAGCTGGGA 
GCTGGATCAT 
TCAAGCTGGT 
TCTGCTGGAC 
TGTTCCAAGG 
ACGACCAAGA 
GCGTGTCCTG 
TTAGCAAGGA 
GCCAATTCAG 
GTTTTACCGT 
GTCATGATTT 
TTCAAATAAA 
GTCAAGCCTT 
TTTATTAGCT 
AATTCTAGTC 
AGAACTCCTT 
TAAACCTGCT 
GCTTCCATCT 
AACGGGACTC 
ATATCATGTC 
CCAAACCTTT 
AGAGATTCTA 
ATTTCATCAA 



1231 

TGACCAAGAA AGTCATTTGA 
CTGCATCTGT CGGAGTGATG 
TCAAGTCACC CGCTTTGGCT 
GTTTTGTAAC ACGGAAGTTT 
CAACCTTAAC TGTACCACGG 
AGTCCAAAAG TGACACTTGG 
TATGGTCGAT AATCGTGTCA 
CTGACAATGA AGAAGTTCCG 
CGTCATCTGC ACCAAGCTCG 
GAGCTGATGG CTTATCGATT 
CTTTTTTCAA TACGAAACGA 
CAACACCGTC AACCATTTTC 
GTGTGTCCAT AATGTTGATA 
TGGTAATTCC ACGCTCTTTT 
TCCGTGCATC AAGCGTTTCT 
GGTCAACGTG GGCGATAATC 
CCTCTATAAT ATTCAAAATT 
TAACATAACT CAAGCAAGTG 
TTCAAAGCGA GCGACTTATG 
CAAGAAAAAA TCAGCCGAAA 
GATGGTTGCC CAGCCCGCTC 
TTTCAGGATC GAATCATTCA 
GGTGCCGTTA CAGCCAACAA 
AACATCCAGT CTGACAAGCT 
CTCCTCTTGA CCGATAACGG 
GATAAGACTT ACCAAGTTGA 
CAAAAAGGAA TTGTCTTTTT 
TCTGCAAGTC sCTCCCTCAG 
ATCAAGAAAA TGTTCCTCTC 



AGAGTTGGCT 
GTTTCACCGA 
TCTTGGATTT 
TTAGTTGTAC 
AAGACACGAC 
AACTGCAAAG 
AAGATTGGTG 
TTGATCGCTG 
ATGAAAAGTT 
TTGTTAACAA 
GTTTGTGGCA 
ATGATACGCT 
CGAGTTCCGT 
TCGATATCGT 
GATTGTTTCA 
GCAATGTTAC 
TATTTTCTAA 
TAAATGTTTT 
ATAAGATAGG 
GGCCATGAAG 
CCTAGCTCAA 
AGGCTATGAA 
AGACAAGGAA 
CTATGCCGTT 
TCCCTTGGGC 
GGTTAATGGA 
AGATGACACT 
TCAAGCCTCT 
GGTTGGTGTT 



CATCGATGTG 
CAAAGATGTC 
CACGACGTTC 
CGTCAAGTTT 
CGATACCGAT 
GCTCATCTGA 
CCATAGTCGC 
AAGCATAAAC 
CCAAGACTTC 
CCACGATTGG 
TGGTTCCTTC 
CAACTTCTCC 
TGTAAGCAAC 
TTGAGTCCAT 
ATAATTCGTC 
GGATATCTTC 
CTGAACGATT 
CACTCTGCTT 
CACAGTATGC 
CAAGCACTCC 
AATATCGATA 
CACACCTATC 
CTTCCGACCG 
GGCCGACTGG 
TTTCAGCTCC 
CTTCTAACAC 
GTCTGTAAAC 
ATCACCATTT 
AAGGTGACTA 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 
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GCCTCAAAAG AATCCAATTT GGGGACTTCA CATTOAACCC AGATTTAGCA 
ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAG 
AAAACAAAAA AAGCTTTAAA ACTAAAGCTT TTTTCTTTTA TTTACCGAAA 
ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AGATATTAAG 
ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG CCAAGACCAA 
CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA 
AAAAGTGGAT TTTTCTTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT 
TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA 
GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT 
AAAACCTTGT TTTTTCAAGT AAAGAGCTGC ATTTGTAACT CGTTGCGCAC 
GTAGAGAAGG ACAGGTTTAT CTTTACGAAG GGCTGCAAGA CTAGTTTTCA 
AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT 
AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA 
ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA 
CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC 
TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC 
GCATCGTAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA 
ATGTAAACAA T AAT AC CAAA TCGTTTGAGG AATTGCTGCA CATCATAGAA 
AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTCACACA 
TTTTCCTATC TTCTTTAGCG ATTCTAAGGC AAGTATCGTA CAATAAAAAC 
AACAATTACA TT 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



GAAGGTAACT 
ATGAGTCGAT 
AATTAAGGCG 
AATCTTTTCT 
TCCTATGATA 
GATACCCACA 
TTATACAAAT 
AATAAGGTTA 
AAATCTCAGA 
GTTGGTTTTC 
ACTGACTTGA 
CGCGCAAATC 
TTTTAGCCGC 
TAAGTGCCCA 
CAATTCTACC 
CATCAACCCT 
GCGTTTTCCC 
TGTTTTCATA 
ATGCTCCTTA 
ATGGGGATTC 



3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4692 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
GCTAAAAAGC TGATAATCTT CGACTCCTGT ATATGATGTG TCTTTTCATG TAAGACACGC 
GCCGCCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 
AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 



60 
120 
180 
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AAAACTACCA AAGTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT AGTTGCTAAC ATACTTCCTA AAAAGAATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA GAGGTCACTA GAAAACTACC AATAAACATG 360 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 420 

CCATTTGATT TAGCTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAG 480 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT GAGGAATGAA CTGAGATTCA 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC ATAGTCTTGG CGAACCAAGA AAGTAAAAGG CATGGTCACC 660 

AAGACCATCA AAACCACAGG TGGCACAATC TCGATAAAAG CGTCTT 70 6 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
CAGCTGATGG GCAATATCAG TCATAGAAAT TTTTTCAATT AACTTTTGAG CAATTTTTTG 60 
GTTGATGATA CGAGGGATTT GGTGATTTTT CTTTACCAGG GGAGTCTCAG CAACCATCAT 
TTTTGAACAG TGATAGCACT TGAAACGGCG TTTTCTAAGG AGAATTCTAG AAGGCATACC 
AGTTGTTTCG AGGTAAGGGA TCTTAGACGG TTTTTGAAAG TCATATTTCT TCATTAGACT 240 
TCCACAATCA GGGCAAGATG GAGCCTCATA ATCCAGCTTA GCGATAATTT CTTTGTGGGT 300 
ATCCATATTG ATGATATCTA GAATCTTGAT GTTTGGGTCT TTAATATCGA GCAGTTTTGT 
GATAAAATGT AATTGTTCCA TATGATTCTT TCTAATGAGT TGTTTTGTCG CTTTTCATTA 
TAGGTCATAT GGGACTTTTT TTCTACACAA AAATAAGCTC CATAATATCC ATAGGGGATT 



120 
180 



360 
420 



480 



TACCCACTAC AAATATTATA GAGCCCGAAA ATATGGGAAA ACTGATCCTT GTTTCTGCTT 540 

TTGTCTATAG AAGAATAATA AAGATTATCT TCTTCAAATT CTCCGATATT CTCTAAAGTT 600 

TTGTGCAAGT TGCACAGAAC TTGTTTATTT TTTTGGTCAT CTTGCCATAG AAATATAAAG 660 

CGTTTTCATA TATAATATAA TTATCAAAAG ACAAAAGGAG TTCACCTCAT GGTAGAATTG 720 

AATCTTAAAA ATATTTACAA AAAATATCCA AACAGCGAAC ACTATTCAGT TGAAGATTTC 780 

AACTTGAACA TCAAAGATAA AGAATTTATC GTTTTCGTAG GACCTTCAGG ATGTGGTAAA 840 
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1234 



TCAACTACAC 


TCCGTATGAT 


TGCTGGTCTT GAAGACATTA 


CAGAAGGTAC 


X VJ\^tt X V— 1 X \_ 




GATGGCGTAG 


TTGTCAACGA 


CGTAGCTCCA AAAGACCGTG 


ATATCGCCAT 


GGTATTPP A A 




AACTACGCTC 


TTTACCCACA 


CATGACTGTT TATGACAACA 


TGGCTTTCGG 


T w T lr P{"3 AAA TTfl 




CGTAAATACA 


GCAAAGAAGA 


CATTAACAAA CGTGTTCAAG 


AAGCAGCTGA 


AAT ACTTfifi A 


1 npn 
xuou 


TTGAAAGAAT 


TCTTGGAACG 


TAAACCAGCT GACCTTTCAG 


GTGGTCAACG 


X V>/W\V.U X Aj X X 


1 1 4/1 
llfiU 


GCCATGGGGC 


GTGCGATTGT 


CCGTGATGCG AAAGTATTCT 


TGATGGACGA 


ArPTTTfiTPA 


1200 


AACTTGGATG 


CCAAACTTCG 


TGTATCAATG CGTGCTGAAA 


TCGCTAAAAT 


x ^n\^\.u x l 


1 9fin 
xz ou 


ATCGGAGCTA 


CAACTATCTA 


TGTAACTCAC GACCAAACAG 


AAGCGATGAC 




1320 


CGTATCGTTA TTATGTCAGC TACTAAGAAC CCTGCTGGTA 


CAGGTACTAT 




1 1 fin 


GAACAAATCG 


GTACTCCTCA 


AGAAGTTTAC AAAAATCCAG 


TTAACAAATT 






TTCATCGGAA 


GCCCAGCTAT 


GAACTTCATC ACCGTGAAAT 


TGGTTGGTAG 






TCTGACGGTT 


TCCGTTTGAA AGTGCCAGAA GGAGCATTGA AAGTTCTTCG 


TY2A AAA IPPr" 


IDOL) 


TACGAAGGAA 


AAGAATTGAT 


CTTTGGTATC CGTCCAGAAG 


ACGTGAATGC 


ALiAAuL X OCT 


1620 


TTCCTTGAAA 


CATTCCCAGA 


CTGTGTTGTA AAAGCGACTA 


TCTCTGTATC 


AOAAUTGCTT 


1680 


GGTTCAGAAT 


CTCACCTTTA 


CTGTCAAGTT GGTAAAGACG 


AGTTTGTTGC 


Jl IV A A f^m/TV"* n m 
AAAALil IXaAi 


1740 


GCTCGTGACT 


ACTTGCAAAC 


AGGTGCAACA GTTGAGCTTG 


GATTTGACTT 


GAACAAAGCA 


1800 


CACTTCTTCG 


ATGTAGAAAC 


TGAAAAAACA ATCTACTAAA 


ATAAATAAAA 


TTCAAAGCAC 


1860 


TACAAGAAAA 


GATATCTCTT 


TATCAATTGT AGTGGAGAGA 


TATCAGTTAA 


TCTAGGGAGA 




GAAACAAAAT 


GCTTCTCTCC 


TTTTTGCTAG AGAAGTCATA 


TTATGCATCT 


ATATTGTGAT 




GCTCTTTAAT 


ACTCTTCGAA 


AATCTCTTCA AACCACGTCA 


ACGTCGCCTT 


GCCGTACGTA 


2040 


TGATTACTGA TTTCGTCAGT TTTATCTGCA ACCTCAAAGA 


TGTACTTTGA 


GCAGCTTACG 


2 100 


GCTAGTTTCC 


TAGTTTGCTC 


TTTGATTTCC ATTGAGTATT 


ATTTGTGGGT 


ACCATCTACA 


2160 


AGTGAAGCTA TATGCGTAAA CTACGTGAGC AATTGAATTC GAACTAGAGA GGTAATAATA 


2220 


AATTTATGCT 


ATAGTTATGG 


TGACTTGTAT GCTTTTGATT 


CTAGTTTATC 


AAATAATAGA 


2280 


TTAGAATTGT 


CAGATAATAT 


CATTTTGTGT TATAATGAAG 


AAAAAACAGA 


GGTGTTCAAA 


2340 


TGTCAGAAGC 


AGGTCATAAG 


TTTTTAGCAA AATTGGGGAA 


AAAACGCTTA 


CGTCCAGGTG 


2400 


GAAAGCGTGC 


CACAGATTGG 


TTAATTGCAG AAGGAGGATT 


TTCAAAAGAA 


AAGAGAATAC 


2460 


TAGAGGTTGC 


GTGTAATAGG 


GGAACTACAG CAATTGAGTT 


GGCACAGCGT 


TTTGGTTGCA 


2520 


AGATAACTGC 


TGTTGATATG 


GATGCTCAAG CTTTAGAAGT 


GGCTAAAAAA 


TCTGCTGGAA 


2580 


CGGCAGGTGT 


TGCTCATTTA 


ATCAGTTTTG AAAGAGCAAA 


TGCAATGAAA 


CTTCCTTATC 


2640 
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AAGATGCTAG TTTTGATATT GTTATAAATG AAGCTATGCT 


GACTATGCAA GCCGATCAAG 


2700 


CTAAGAAAAA ATGTGTAATG GAATATCTAA GGGTATTAAA 


ACCTGGAGGT 


CTTCTCTTGA 


2760 


CACATGATGT GCTTCTTAAG GAAGCTAAAG AGTCTATCAG 


ACAGGAATTA 


TCACAAGCAA 




TTCATGTAAA TGTAGGTCCT TTAACTCAAG ATGGTTGGGA 


ACAGGTGATG 


ATAGAATCAG 


2880 


GTTATTGTGA TGTGAAAGCA TTGACTGGTG AAATGACATT 


AATGAAATTA 


TCGGGTATGA 


2940 


TTTATG ACG A AGGTTTGCTA GGAACTTTGA AAATTTGTGT 


AAATGCTTGT 


AAAAAGGAGA 


3000 


ATAGAAAGCA GTTTTTAACT ATGTATAAAA TGTTTGCTAA 


GAATAAACAG 


AAATTGGGCT 


3060 


TTATTGCGAT GGCTAGTTAT AAATCGTCAA AACGTTAGAT 


AATTATTGAA 


GTTAACTTTT 


3120 


CCTTTTTTCT TTCTTAAAAA ATATGCTATA ATAGAGAGTA 


AAAAACTTTG 


AAAGAAAGAA 


3180 


AAAGATGAAT TTAAAAGATT ACATTGCAAC AATTGAAAAT 


TATCCAAAGG 


GTACCG 


3236 


(2) INFORMATION FOR SEQ ID NO: 223: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 



CCTGACTTTT CAAATTGGTT AGTTTGCCAC ACTTGGTTTA TATGGTCGTG 


GAAAGCATGG 


60 


CTATTACTTC TCAAAGGGCG ATTTCTCACC CCATGAAAAG TGTCTATTTT 


TGTTTAGGTT 


120 


TGTAAGTTAA TTCATTGTCA CATATTACTC TTTAACTGAT TGAGTGAGTA 


CCGCTTATAT 


180 


TTGATGCCAA ACGCCTTAAA AGTGTTACCC TCAAGTCCTT TTAGAATACG 


GCTATAATTC 


240 


CGCTCATTGT AAACTATCTT AAGCTCATCA CTATCTAGGT TGGTATTAAA 


AATGGTATTT 


300 


TCACGATTGT TTAGCACGTC AAAGAGTAAA TCCTGCTCCC AGTCACTCTT 


AGGCTTAATA 


360 


ACAGCATTTT TTGCTCCTAA ATCATCAATA ATTAAGTAAT CAACAGACTT 


CATGAGTTCA 


420 


GTAGCTTCAA ACTCTGTAAG TGTTGCACCT TTACCATAAT TCCACCCCTC 


TTTAATTTGT 


480 


TTGATCATTT CGGTTAGGCT TACAAAAAGC ACACTCTTAG GTTCTCCTTT 


TGTCTTATAC 


540 


CCCTCATTTA TACCTTTGGC AATAGCAACT GATAAAAGTG TTTTTCCAAT 


CCCTGTACCT 


600 


CCTGTGATAA GCGTATTTCC CCTCATGCCA TCAAGATATT TTTGTACCTG 


ACCTTTTGCA 


660 


AATTCTAAAA ATCGCTTTTC TTCTGATGTT ACAGCATTAA AATCATCAAA AGTTTTAGTT 


720 


TTAAACTCAT CTGCTACATA GCTCTTATTG CTCATCAACA CATTATAAGT 


TTGCATATAT 


780 
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AGTTTAGCAT 
TGGCATTGTT 
TTCCACCCCA 
GGACACAATT 
ATTTTCTGAC 
GCTCTAGTTG 
GTTCCACCGT 
AGTTTGCTTT 
AATCTTCTAG 
CAGCTCGCTT 
AATCAGAAAT 
CAAATTCTGC 
CTCTATAAGA 
TTTTAAGGTT 
CTCTGCTTCG 
GTCCATTTCA 
GATTTTATGG 
TTTGATGGAT 
AATTTCTCGC 
TTCTGTCATG 
TTCTAAACAT 
CCATATCCTC 
TATAGCCCCC 
GCAAGTTGTT 
CATGCAAGGT 
CCTCACGCAT 
TATTCTTTAA 
TAGGTTGTCC 
AGTCAGCCTT 
CAGCGATGAT 



TCAAATTATC 
CACAATAGGG 
CTACTACATG 
CGTCTAGTTT 
CTCCTAAAAT 
ATTGGTTTGC 
CAATAAATTG 
ACCCTGACTG 
CATATACTGC 
CATATTCAAC 
CAGTTGAAAA 
TTGTGTCATG 
CAACTTTTAT 
ACAGTAGCCT 
TTCTCAAAAA 
AAAATACCAG 
CTTTCATTCA 
TTTGTCTGAC 
ATTTCTGCGC 
TTTTTACCTC 
CATTGTCTTA 
AAATGCCTGG 
ACGCTCTTCT 
CATGGTGCTA 
TTTTCTTTTC 
TTCAAAGAAT 
ATAAGTGATC 
TCTAGTATCT 
ATTTTCTCTT 
CTCGCTTGTG 



AGCAATCGCA 
TGGGATACAG 
TCTTTCTCCT 
AAATGTCTCA 
GGTTTTTCTT 
TGTTTTTTAG 
TCCTGTTTCC 
ACAGCCTCTT 
AATTCTTGAA 
AAGTCGTCTA 
ATGTTTGGAC 
TTGTCTAAAT 
TTGCTTTCTG 
CTAAATACTC 
AGTGCTTAGC 
GGGCGAAAAA 
ATTCAGGAAG 
TTTCCAATAA 
TAATTCTGTC 
TGTTTCTTTG 
ATTTCCTGAT 
TACTGCTCCA 
CTTAACTGCT 
TGCGCGTGCT 
GGTTTTTCTA 
GCTTTGACTA 
AGAAAAGTAG 
AATTTATGGA 
ATTAAGCGCG 
GTGTACGGCT 



1236 
TCTTCTTCAT 



CGAACTTCTT 
TTGATTTGTG 
ATATTTCCTA 
GTGTTGGTAT 
CCTCACGCGC 
AACGGTTAAG 
TTAACGCCTC 
TCTGTAACGG 
TTTCCACACT 
TTTGTAGCTG 
CTAGTGTCAT 
AGTTCCATTT 
CTTTTCAGCA 
TTGGCGTTTA 
CATTCCCGTA 
TTCAATCCAA 
AGAAAGGATT 
TATACGTCTA 
TTGGTGTGAT 
AACTCATTTT 
ACTCCTCACT 
TAGCGTTCAT 
TTGGTGCATT 
GCGCCCTCTG 
GGTTTAGTTT 
CCTGTTGCTC 
TTTTAAATCC 
TGATAGTGTG 
CTTTCTTACC 



CTTGCTTTTT 
TTATTGCCTC 
TTAGCTGTAT 
AACTAGATTG 
CCAATCTTCA 
TGCCCTGCTA 
GATTACCTTG 
ATGGATAAGC 
TGACAATGCT 
GGTTACTTTT 
GATTTCAGCC 
TGCATTGCCT 
TTAGAGTTAA 
TGCTCTATAT 
AAGAATGCTT 
GTGCTTTTAG 
AGTAAACGGG 
CTTAGGCCAT 
GTTAAATTCT 
TTTTTAGCTT 
CAATTCAATC 
AGTCAAGCTA 
GTCTGTTACT 
AGGCCATGTT 
CAGACGAATT 
GAATTGCCGT 
GTTCAGAATA 
AAGTATTCCC 
GTGTTGTACT 
GTCCATGTAA 



CTGTTCTTCT 
TCCGTTCTCA 
TTCATGCTTA 
TAATGATTTC 
TAGCTGGTAG 
TTTCTAACAA 
ATGTATGCAA 
TCTGGGCTAA 
TTACCTGTCT 
TTATTTACAA 
ATTACCTCAT 
CCTCAAACTT 
AAAGAATATC 
ACGCCTGTTG 
TTCGCATAGC 
AGACCGCTTC 
ACAACTCATC 
TTTCTTCGCT 
CATATGTTGT 
ATTTTTTTAC 
ATAGCTATTG 
TCGATACCGT 
GCCTTTAGTA 
TCTATACTGT 
TCAGAAAGTT 
ACTGTTTCGG 
TAGGATTTTT 
AACTCTTCAA 
TCAGCACATT 
ACTAGTTCCA 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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TTACGGTTCT ACCTCCTGTA TAAATCTGGT TAGCTTACTT TTTAATTGCC TCCTCTAGCC 
TCTTTTTTAG CCTCTAAAAC GGCTTTGGCT AGTGGTTAAT ATTATTTACC ACTTGTCTCT 
ATAAACGTGT TAGAGGCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGTGGAA 
TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTGTTTAA CCCATTTCTT GAAAGAAATA 
AAATTACATC TTCTTTATCC TTGGCATCTG CTTTGTCTGA GACAAATTAG AATGTCAATA 



CTTGG 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3144 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
TATCAATCCT TTCCCATTAT AGGAGCAACA GAGTGGGAGT AGTCATCTAA GGACTAATTT 
ATGTATTTTT ACGAGTCAGT ATCTTGGGAT ACTGGTTTTT ACTTTTCTAG ACTTTTTGAC 
TACTTGTTAA AACTGGGATA ATTTTCGACT GTTTAACAGT TATTATGCAA AGTCTAAAAG 
ATTAGAATTG TCAAAACAAT CCGTCTAGGC TTGATTTTAT CCTTTATTTA CTATAAAATG 
AGAAGGAAAA ATGTCAAACT TTTATATTGC AAATAGGAGA AATCATGACA AAAACATTAA 
AACGTCCTGA GGTTTTATCA CCTGCAGGGA CTTTAGAGAA GCTAAAGGTA GCTGTTCAGT 
ATGGAGCAGA TGCTGTCTTT ATCGGTGGTC AGGCCTATGG TCTTCGTAGC CGTGCGGGAA 
ACTTTACTTT CGAACAGATG GAAGAAGGCG TGCAGTTTGC GGCCAAGTAT GGTGCCAAGG 
TCTATGTAGC GGCTAATATG GTTATGCACG AAGGAAATGA AGCTGGTGCT GGTGAGTGGT 
TCCGTAAACT GCGTGATATC GGGATTGCAG CAGTTATCGT ATCTGACCCA GCCTTGATTA 
TGATTGCAGT GACTGAAGCA CCAGGCCTTG AAATCCACCT TTCTACCCAA GCCAGTGCCA 
CTAACTATGA AACCCTTGAG TTCTGGAAAG AGCTAGGCTT GACTCGTGTC GTTTTAGCGC 
GTGAGGTTTC AATGGAAGAA TTAGCTGAGA TCCGCAAACG TACAGATGTT GAAATTGAAG 
CCTTTGTCCA TGGAGCTATG TGTATTTCAT ACTCTGGACG TTGTACTCTT TCAAACCACA 
TGAGTATGCG TGATGCCAAC CGTGGTGGAT GTTCTCAGTC ATGCCGTTGG AAATACGACC 
TTTACGATAT GCCATTTGGG AAAGAACGTA AGAGTTTGCA GGGTGAGATT CCAGAAGAAT 
TTTCAATGTC AGCCGTTGAy ATGTCTATGA TTGACCACAT TCcAGATATG ATTGAAAATG 



2640 
2700 
2760 
2820 
2880 
2885 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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GTGTGGACAG 
CCAACTGCTA 
TCAAACAAGA 
TTTACTATGG 
AGTACAAGTT 
GTCAACGAAA 
AtTTTGAAAC 
CAAATCCAAT 
TTCGAGCTCT 
TTCGTGCTTA 
TAAACGAGAT 
TTTTTTAAGT 
GATGTCTTAG 
GAGAGTTTTT 
ATATGATATA 
AAAACCAATA 
ATTCTCAAAG 
AATGGAGTTG 
GGTAATATTA 
TCTAAAACAG 
CAAGAAAGAT 
GTAAAGACCT 
GAACCTGATA 
ATAGTTTTAG 
AGTCATAAAT 
CTTTTGACAT 
TCATCAGCTA 
GAAGAGGGAT 
TTTAACGAAA 
ATTTACCTAA 



TCTAAAAATC 
CAAGGCGGCT 
CTTGGTGGAC 
TACACCATCT 
TGTCGCTGAA 
CGTCATTAAC 
CTATATTGAA 
GGAACTATTG 
TAAAGAGGGG 
ATGTAGTTGT 
TAAAGAATGG 
GATAAAGTCG 
ATATTTTGAA 
TGTTAATAAA 
ATATTGTTAA 
ATATTAATGG 
ATATAAATTT 
GTAAGAGTAC 
TAAGCAGTGA 
GTTTAGAGAA 
TTAGATGTTT 
ATTCTTTGGG 
TATTGATTTT 
CGGTTCTAAA 
TAGAAGACAT 
TTCAAAAAGT 
CAGATAGAGA 
TGAGAATTAC 
ACTCTATTAA 
ATCGTTCAAA 



GAAGGACGTA 
GTGGATGCCT 
GAGATGTGGA 
GAAAATGAGC 
GTGGTTTCTT 
GAAGGGGACC 
GATTTGCATG 
ACTATTAAAG 
CTTATCAATC 
TTAGTTTTAA 
CGAAATCCCT 
GAGTTTAGGC 
AAAAATTAAT 
ATTTCACAAA 
AAAGAGGCGC 
AGGAATAAAA 
TGCACTTAAC 
GTTGATGAAA 
TAATGTTGGG 
TTTAAAATAT 
GATCCAAGAG 
TACAAAACAA 
AGATGAACCG 
AAAATTAGCT 
TGAAGAAATT 
AGGAAAAGAT 
CATTTTCATT 
TATGTCTGGG 
AGTAGTTGAT 
ATAAAGGAAG 



1238 
TGrAGTCTAT 



ATCTTGAAAG 
AGGTTGCCCA 
AGTTGTTTGG 
ATGATGATGC 
AAGTTGAGTT 
ATGCTAAAGG 
TCCCACAACC 
TTTATAAGGA 
AAAACTATGC 
TGATGCGCAA 
ATCAAAGCCT 
AAGCAGAAAA 
ATGACATTTA 
AACTTTTTAA 
AATGTAAGTA 
AAGGGTGAAA 
ATTCTTGTTC 
TATTTAATCG 
TTGTCAAATT 
TTAGATTTGA 
AAATTAGCTT 
ACTAATGGTT 
TTACATGAAA 
TGTGAGAGAG 
AGTCATAATT 
ACCAAACAAG 
AATATTCAAA 
TTTGAAACTA 
GTTATAATCA 



TCACTAyGTA 
TCCTGAAAAG 
ACGTGAACTG 
TGCTCGTCGT 
GGCACAAACA 
TTATGGTCCA 
CAATAAAATC 
TGTTCAATCA 
AGATGGAACC 
AAAGCTCCAT 
GAGATTAGCT 
ATCAAATTAA 
CTCTCTATTA 
TATATTGCAT 
AATTAATGAG 
AGCATTATGG 
TTGTTGGTCT 
AGAATAATCA 
AAGAACCAAA 
TATATGGTGT 
CTCAGTCTAT 
TGCTTCTAAC 
TAGATATTGA 
ATGTGGGAAT 
TTCTTTTCTT 
TCTTGTTTGA 
AATTTTGGGA 
ATAGTGAGCT 
AAAAAGAGAC 
TGAAATTAAA 



TCAACAGTAA 
TTTGAAGCTA 
GCTACAGGAT 
AAAATCCCTG 
GCAACTATTC 
GGTTTCCGTC 
GACCGCGCTC 
GGAGACATGG 
AGCGTCACAG 
ATACAACACT 
GTCTTTTTTA 
ACAAAGAAGC 
TTTTGTTGTA 
TAAGTTAGAT 
AATCAAAGAG 
TCATTCAATC 
AGCAGGGAGA 
ACCGACTTCA 
ATTATTTTTA 
TGACTACAAT 
TAATAAAAAA 
TCTCGTTACG 
ATCATCACAA 
TTTAATATCG 
GGAGAACGGG 
GATAGCTTTT 
TATTGTTTAG 
TTTTAAATTT 
GCTTAAAGAT 
TAAACAGAAG 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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AATCGGATGA TTTACGTCTT GTCTAATTTT 


CTATATGCTA 


TCTCAGTTTC 


CATTATTTAT 


2880 


GCTTTGAATG GCATTGTGTT ACTAGTCATA 


GTAAGTAAAT 


TGGGTATTCC 


AGGTGATTTA 


2940 


GGATTAAATT TTATAGTAGC TATTGTAGTC 


AATACAATTT 


TGTTAGTCCT 


GTTTTATTTT 


3000 


CTATTATCTT ACATTTTCTA TTTATACAAA 


TTGAAAAGTG 


GCTTGGTATw 


TGGTATTTTA 


3060 


GTAGCTTTAC TACTCTTTAT CTCTAATATA 


TTAAATACGA 


TGATGATGAA 


TACTAGTAAT 


3120 


GATTTGTTTA TCAAAGCAAT TGAA 








3144 


(2) INFORMATION FOR SEQ ID NO: 225: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3766 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
tO) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
TACGGTATTA TTTTTAAGGA GAAAGAATCA TGAAAATCAA AAAATGGCTT GGTCTAGCAG 
CCCTTGCTAC AGTCGCAGGT TTGGCTCTTG CAGCTTGCGG AAACTCAGAA AAGAAAGCAG 
ACAATGCAAC AACTATCAAA ATCGCAACTG TTAACCGTAG CGGTTCTGAA GAAAAACGTT 
GGGACAAAAT CCAAGAATTG GTTAAAAAAG ACGGAATTAC CTTGGAATTT ACAGAGTTCA 
CAGACTACTC ACAACCAAAC AAAGCAACTG CTGATGGCGA AGTAGATTTG AACGCTTTCC 
AACACTATAA CTTCTTGAAC AACTGGAACA AAGAAAACGG AAAAGACCTT GTAGCGATTG 
CAGATACTTA CATCTCTCCA ATCCGCCTTT ACTCAGGTTT GAATGGAAGT GCCAACAAGT 
ACACTAAAGT AGAAGACATC CCAGCAAACG GAGAAATCGC TGTACCGAAT GACGCTACAA 
ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CAGCTGGCTT GATTAAATTG GATGTTTCTG 
GAACTGCTCT TGCAACAGTT GCCAACATCA AAGAAAATCC AAAGAACTTG AAAATCACTG 
AATTGGACGC TAGCCAAACA GCTCGTTCAT TGTCATCAGT TGACGCTGCC GTTGTAAACA 
ATACCTTCGT TACAGAAGCA AAATTGGACT ACAAGAAATC ACTTTTCAAA GAACAAGCTG 
ATGAAAACTC AAAACAATGG TACAACATCA TTGTTGCAAA AAAAGATTGG GAAACATCAC 
CTAAGGCTGA TGCTATCAAG AAAGTAATCG CAGCTTACCA CACAGATGAC GTGAAAAAAG 
TTATCGAAGA ATCATCAGAT GGTTTGGATC AACCAGTTTG GTAATAAGAA ACAGGGAGGT 
GGGAGAGAAA ATTCCACCTC TTGCTTTTGT ATAGAGTATA GATTGTAAAG AAGACTATTC 960 
GTTCATAGAA AGGTAGAGAG AATATGGTTT TTCCTAGCGA ACAAGAACAG ATTGAAAAAT 1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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TTGAAAAGGA 
AATCAGTCTT 
TCAAGCGTGT 
CACATTTCAA 
CTGTGCCAGC 
ATGGCTTCAT 
GTGCTTTGAG 
TGGAGGGAGC 
ACAAACTCCG 
AGCTGGAAAT 
CTGATGTGGA 
TCCAAGCCTT 
AAGAAGTACA 
ACCCAGAGGA 
TGGCCTTTCT 
GTTATCAAGG 
TTCGTCTGGT 
ACAAAAATGG 
GCGATATGAG 
AGGGCGTTTC 
CCCTAGAGGT 
GAGATGAAAA 
TTAGAAGCTA 
AAGAAGAGAA 
TACGGAATCG 
TTGCAAAAAC 
GTGACCTTGA 
CATTTTAACC 
TCTGAACTCA 
TTGGCAGATC 



TCATGTAGCC 
TGCCCAGCAG 
TGGAGCTGAA 
GAGTTCGCGT 
GGATGGGGAT 
GTATGGGCGT 
AAAATATATG 
GGAGGAATCG 
TGGGGCGGAT 
TTCTGGTGGC 
TATCCACTCG 
ACAGTCTCTT 
AGAGCCCAAT 
AGTTAGTCGG 
AAAACGTTTC 
TCAGGGTGTT 
TCCGGGCCTA 
CTTTGATAAG 
CGCACCAGCC 
AGTCTTGCCG 
ACCAATGGTT 
TGTGCGAATC 
TGAGTAGAGA 
CCATCACAGC 
TTGGATATTC 
CATCTGCAGG 
CGGCAGAGCA 
TGATGAGCCA 
GCAAGGAAGA 
GTGCTGAAAA 



CAGCATTATT 
GTTGGACTCA 
GTGGAGATTG 
CCAGATGCCA 
CAGGTCTGGA 
GGGGTTGATG 
CAGCACCATG 
GCTTCAACAG 
TTGTTGGTCT 
AATAAGGGGA 
AGTTATGGTG 
CGTGCTGCGG 
GAACGAGAAA 
ATTTATGGAT 
TTTTTCGATC 
AAGACTATTT 
GAACCGCATG 
GTAGAATTAT 
ATTCTCAATG 
ACGACAGCGG 
GCATTCGGTC 
GCTGATTATT 
TATTATCAAG 
GGTTAAGGAT 
TGGAGCAGGA 
GAAAATTACC 
GTTGCGTCGT 
AAAGACAGCA 
AAAGAAGGCT 
CTACCCTTCA 



1240 
TTGAGGTTTT 



AGGAAGTCGC 
ATGAGAGCTA 
AGACCTTGAT 
CAGAGGATCC 
ACGACAAGGG 
ATGATTTACC 
ACCTAGATAA 
GGGAACAAGG 
TTGTGACCTT 
GTGTTGTGGA 
ATGGCCGTAT 
TGGCCTTGCT 
TGGAGTTGCC 
CAGCGCTTAA 
TACCTGCAGA 
ATGTTCTGGA 
ACTATACCTT 
TGATCGAGTT 
GGACAGGACC 
TAGGAAATGC 
ACACCCATAT 
TTAGATCAGA 
GTGACCATTC 
AAATCAACCC 
ATTGACGACG 
AAACGTCAAG 
GAGGAGAATG 
AAAGTAGCTA 
CAACTATCTG 



GCGTACCTTG 
AAATTATCTG 
TACAGCGCCC 
TTTCTATAAC 
kTTTACGCTT 
TCATATCACA 
TGTCAATATC 
GTATTTGGAA 
GACCAAAAAT 
TGATGCCAAG 
ATCAGCTCCT 
CTTGGTTGAA 
AGAAACTTAT 
TCTCTTACAG 
TATCGAAGGA 
AGCCAGTGCC 
AAAAATTCGG 
GGGAGAGATG 
GGCCAAGAAA 
TATGCATACG 
CAATAGCCGA 
CGAATTAGTA 
TCGATGTGAC 
ACATCCAAGA 
TTGTACGGGT 
ATGTGATTTT 
ATATCGGAAT 
TAGCCTTTGC 
AGTTGTTGGA 
GAGGGCAAAA 



ATTTCTAAGA 
GGTGAGATTT 
TTTGTCATGG 
CACTATGACA 
TCGGTCCGCA 
GCTCGCTTGA 
AGCTTTATCA 
AAGCATGCAG 
GCCTTGGAAC 
GTAAAAAGCG 
TGGTATCTCC 
GGCTTGTACG 
GGTCAACGAA 
GAGGAGCGGA 
ATCCAGTCTG 
AAGCTAGAGG 
AAACAGCTAG 
AGCTATCGAA 
TTCTATCCAC 
GTCTTTGATG 
GACCACGGTG 
GAGGAGCTGA 
TTTTCACCAA 
AGGGGATATC 
GATTAATCTC 
TGACGGCAAG 
GATTTTCCAG 
CCTTAAACAC 
CTTGGTTGGT 
ACAGCGTGTG 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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GCAATTGCGC GTGCCTTGGC CAATGATCCA AAAATCTTGA TTTCAGACGA GTCAACTTCT 
GCCCTTGATC CGAAGACAAC CAAGCAGATT TTGGCCTTGT TGCAAGATTT GAACCAAAAA 
TTAGGCTTGA CTGTTGTCTT GATTACGCAT GAAATGCAGA TTGTCAAAGA CATTGCCAAC 
CGTGTTGCAG TTATGCAGGA TGGGCATTTG ATTGAAGAGG GTAGTGTGCT TGAAATCTTC 
TCAAACCCTA AACAACCTTT GACTCAAGAC TTTATCTCAA CAGCTACAGG TATTGACGAA 
GCCATGGTCA AAATCGAGAA GCAAGAAATC GTGGAACACT TGTCTGAAAA CAGTCTCTTG 
GTGCAACTCA AGTACGCTGG AGCTTCAACA GACGAGCCAC TTTTGAATGA ATTGTACAAG 
CATTACCAAG TAATGGCTAA TATTCTCTAT GGGAATATCG AAATTCTCGA TGGTACTCCT 
GTTGGAGAAT TGGTGGTGGT TTTGTCAGGT GAAAAAGCAG CGTTGGCAGG TGCCCAAGAA 
GCCATTCGTC AAGCAGGTGT ACAACTAAAA GTATTGAAGG GAGTACAGTA AGATGGAATC 
ATTGATTCAA ACCTATTTAC CAAATGTCTA TAAGATGGGT TGGGCTGGTC AGGCAGGCTG 
GGGAACGGCT ATCTACTTAA CTCTTTATAT GACAGTTCTT TCCTTCATTA TCGGAGGCTT 
CTTGGGGCTA GTGGCAGGTC TCTTTCTCGT CTTGACAGCG CCAGGTGGTG TCTTGGAGAA 
TAAAGTCGTA TTCTGGATTT TAGACAAAAT TACCTCAATT TTTCGTGCGG TTCCCTTTAT 
CATCCTCTTG GCAATCTTGT CACCACTTTC TCACTTGATT GTTAAAACAA GTATCGGGCC 
AAATGCAGCC CTTGTCCCAC TTTCTTTTGC AGTCTTTGCC TTCTGG 
(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3766 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 



TGTTGCTGAG TTAATCGGTA CGTTCATGTT 


TGTATTCGTC GGGACAGGAG CTGTTGTTTT 


60 


TGGAAATGGT CTTGATGGCC TTGGTCACCT 


TGGAATCGCC 


TTTGCCTTTG 


GTTTGGCAAT 


120 


CGTGGTGGCA GCCTACTCAA TCGGAACTGT 


TTCAGGTGCT 


CACTTGAACC 


CGGCTGTTTC 


180 


GATTGCTATG TTTGTAAACA AACGTTTGTC 


ATCTTCAGAA 


CTTGTAAACT 


ACATCCTTGG 


240 


TCAGGTTGTT GGAGCTTTCA TCGCTTCTGG 


CGCTGTCTTC 


TTCCTCTTGG 


CTAACTCAGG 


300 


TATGTCAACT GCTAGTCTTG GTGAAAATGC 


CTTGGCAAAC 


GGTGTCACTG 


TCTTTGGTGG 


360 


TTTCTTGTTT GAAGTCATCG CAACTTTCTT 


GTTTGTATTG 


GTTATCATGA 


CTGTGACTTC 


420 
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AGAAAGCAAG 
GATTCTTGTC 
AGCTGTCTTG 
GCTGGTGGAG 
AAACTCAAAA 
GAAAATCTCT 
GTTCTATCCA 
TCAAAACAGT 
AAGCTGACTT 
GTTCTATCTG 
TCAAAACAGT 
GATCTGACTT 
AACTTCCTAG 
CTGGATAAAG 
GCTTACCGTA 
AGACTTCTAA 
AGTTTTCGTG 
ACTCGACACA 
CAGTGATTTG 
CGATTTCTCT 
CTTCGACTGT 
CACTAAAGAC 
TCAGAGCACG 
GCGATGGCAT 
GGACAAATCG 
CTATTCTAGT 
AAAAAAGAGA 
GGCCTTGATT 
TTTGAAGAGA 
ATCAACGTCC 



GGCAATGGCG 
GGATTGAAGA 
GTAGGCGGCG 
TTCTTGCAGC 
AGCCTTGCTC 
TCAAACCACG 
CAACCTCAAA 
GTTTTAAGCT 
CGTCAGTTCT 
CAACCTCAAA 
GTTTTGATCT 
CGTCAGTTCT 
TTTGCTCTTT 
GTCGTGTTGG 
GTTGTAGTAG 
ATAGCGAGGG 
GAAATCTCCG 
GAGCTTGTCA 
TCCCAGCAGA 
AGTAATTTGA 
TTCATAGTGT 
GACCAAGGCT 
CAAAAAGCTC 
ATCCCAAACA 
ATCAGGACAG 
TTCAATCTAC 
CCAAAGAAAG 
TGTTCTGCTG 
AGAGTTGGAA 
ATTTTAACGA 



1242 

CGATTGCTGG TTTGGTAATC 



TTACTGGACT TTCAGTAAAC 
CAsCCTTCAA CAAGTTTGGA 
CCTTGTTGCA AAAAATTTCC 
CTCATCTTGA GGAACAGGGC 
TCAGCTTCAT CTTGCCGTAG 
ACAGTGTTTT GATCTGACTT 
GACTTCGTCA GTTCTATCTG 
ATCTGCAACC TCAAAACAGT 
ACAGTGTTTT AAGCTGACTT 
GACTTCGTCA GTTCTATCCA 
ATCCACAACC TCAAAACAGT 
GATTTTCATT GAGTATGACT 
AAGAGGCGTT GTTCTGCCAA 
GGGTCGATTG AAATGCCACC 
TCTAGCAAGT TGACCAAGTC 
TGGTTTCGGT AGCTAAATAG 
GGAATGTAGG AAATATGAAT 
GACATATCGA GGATATGGTG 
ATTTCGAGGT GATGACGTTG 
TGCATGACCC AGAAAAGGCA 
AATTGACGTT TCATAGTACT 
CCATTAGGGA GCTAAAAAAT 
TCGTAATATT CTACTTATAT 
TCAAATCGAT TTCTAACAAT 
TATAGTCTAG CATATTTTTT 
TACTTGGTCT CTCGTTTGAT 
TGTGAACACC TGCAACTTGT 
TAGACATGAT TCCAAAAGCA 
TTTTCAAGAC ATCTTCTGAA 



GGTTTGTCAT TGATGGCGAT 480 

CCAGCTCGTA GCTTGGCACC 540 

TTTTCATCCT TGCACCAATC 600 

TTGGAACAGA AGAATAATTG 660 

TTTTTCGTAT GATACTCTTC 720 

TATGGTTACT GACTTCGTCA 780 

CGTCAGTTCT ATCTGCAACC 840 

CAACCTCAAA ACAGTGTTTT 900 

GTTTTAAGCT GACTTCGTCA 960 

CGTCAGTTCT ATCCACAACC 1020 

CAACCTCAAA ACAGTGTTTT 1080 

GCTTTGAGCA ACcTGCGGCT 1140 

TTAGCGGTTG TCAATTTTCT 1200 

GCCCTCATAC TTAGTTCCTT 1260 

GCGCGGAGTG AATTTTCCCC 1320 

TTTCCCGATG GTGTTGATAC 1380 

ATATAGTTTG AGGGATTTTG 1440 

CGTCGCAAAG TCTGGCTGAG 1500 

ACGAATGCCC TGTTCCTTAG 1560 

GCCGTAGGCA AAGGTGACAG 1620 

GGTTGTTGAA TCTTGACCAC 1680 

CCTTCCAAAA TGGGAAATGT 1740 

ACCAAATCGA GGTTTTTTTA 1800 

AGTAAAATGA AATAAGAACA I860 

GTTTTAGAAG TAGAGGTGTA 1920 

GAAAAATGGC AAAGGGCAAG 1980 

TAGCTCAATT CAGCAATGAT 2040 

TTGACAACTT GGCCGTCTTT 2100 

CGAGCTGTGT TTGGATTTTC 2160 

AGTTCTTCAG ACAATTTGTC 2220 
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CAAGATTGGA CCTTGCATAC GACATGGACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 
CAAACCGTCT TTTGTTTCTT GTTCGAATGT TGCATCTGTA ATTGCTTTTG CCATTGTATT 
TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GAGATAGAAG AAGATATCTC 
CATAAGTCCC ATGGTAGTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACAGGGTAGT 
AkkCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 
(2) INFORMATION FOR SEQ ID NO; 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 
ACTCAGTTAG ATTTTGTTTT CAAAAACAAC GAAGAAAAAG ACCATGTTGC TCTACTTGGA 
AGAATTGGCT CCGAACGTGT TTATCGATAT ATTAATAAAA AATATTTAGA TTTACCGGAA 
ACATTCGAAA ATTATAATGT TTTTGTACCA GAAGCTAATG GAAGTGGTGC CTTAGGTGAA 
GTCTTATCAA CACCCCTAAT CGGGGAACCC CTAATCGGGC ATACAGATAC TTTTTTATCT 
ATTGGTAATT TTAAAACAAA ATTTGAAGCC GATGCTTGTA TTAAATTTAT TAAAACTAAA 
TTCGCTAGAG TATTATTAGG TGTTTTGAAA GTTACTCAGC ATAATTCACG CAAAACTTGG 
TATTACGTCC CCCTCCAAGA CTTTACGGTC AATTCGGACA TTGATTGGAC ACAATCAGTG 
ACTGATATTG ACCGCCAGCT TGATCAAAAA TATGACTTTT CCCCTGAAGA AATTGCCTTT 
ATTGAGAATC ATGTAAGGGA GATGGATTAG AAAAGTATTT TTATTTGACA AATAGTGCTC 
AATGATCTAA AATGACTATA TAGGATTAGG TCAGGAAGCA TACGATGCCC TGACCCTTTT 
TGTACTTATG AGATGAGAAA GTCATTTGTT AGATAAATTG ACTCGTTAGC AAACGTTCAA 
AAAAGGAAAA CTTATGCCAG TAGAAATTAA AACCACTAAA GAAATTCATC CTAAAATCTA 
TGCCTACACC ACACCGACAG TAACCAGTAA TGAAGGCTGG ATTAAGATTG GGTATACAGA 
ACGTGATGTC ACACAACGTA TCAAGGAGCA AACGCATACA GCTCATATAG CTACAGATGT 
CTTATGGACT GGTGATGCAG CTTATACAGA AGAGCCTGAT AAGGGGAAAA CTTTCAAGGA 
CCATGATTTC CACCATTTCC TTTCTTTCCA TGATGTAGAA CGTCGTCCCA AGACGGAATG 
GTTCTATTTT AATGGAACTC CTGAAAAATC AAAAAATCTT TTTGATAAGT TTGTTCAGCA 
TGATTTGTCT GGTTATCAGC CTGGAAAAGG ACAGGACTAT ACTCTGCGAC AAGAGCAAGA 



2280 
2340 
2400 
2460 
2520 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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AGAAGCAGTT GCTAAGACAT 
GAATGCCAAG CCACGCTTTG 
AGCTGTCAAT GTCCTAATTG 
TTTTGAAACA TTCATAGCAG 
TAAGAGTCGT CCAATCTTGT 
ACAACTTGCT TTTATCAGTC 
"CGATAAACTC AAATGGGTAA 
TGAAGGAGTT GATACCTTCA 
TCTGCATTTG TCAGGTACAT 
AATCTACAAC TGGTCTTATG 
AGAAGAGGAA AATCCTTATG 
TCAGATGATT GGCGAAAAGT 
TGTTTTTGAC TTAAGTGAAT 
TGATGTCAGA AATTGGTTAG 
AGAACTCCGT AATGAACTCA 
AGCATTAAAA GCCCTACTAG 
TGCTGGTGAC GGACGTATGT 
GGTTAGAAAA GCGATAGCAG 
GACAGGTGTC ACTATCCCTG 
AGCTCTTTAT ATGCAGGCCG 
AGGAAATCAC TTTCGCAAAG 
GATTCTCTTT GATGAGTTTG 
TTCAGCTACA CGCGAAGAAA 
AGACCGTGCT GGTAAGATGG 
GATAAAAGCT AGAGAAGTTC 
TAGTGGTATT TTCCAAGCAA 
AAAGGAAGGG AAGGTACAAG 
TGATGAGGGA AATGCAGTAG 
TGGTGAAAAA GTTTATGGAC 
ACGAACTCAA AAACAGCTGG 



TAGCTTATTT 
GTAAAACCTT 
TAACAAACCG 
GTCAAACGAC 
CACGACAAGA 
TCCAAGACTT 
CTGATCTGCA 
AGACTGACCA 
CATTTAAAGC 
CTGATGAGCA 
AAAGCTTGCC 
TAGAAAAAGG 
TTTTCGCTAC 
ATACTCTATC 
AGCATACTTT 
AAGAACACCC 
CCGAAGAAGA 
AGAATGACAA 
AATGGACAGG 
CCTTCCGTGC 
AAAGAGCCTA 
CCAACAACTT 
ATATTAGAGA 
TTGAAATTGA 
TTAAACGAGG 
GTCAAACAGT 
ATAGTTCTGA 
TAGACCATGA 
TTGGTGAATC 
TCAATGACTT 



1244 



CCAAGAACAT GCTGGAGGCA AGTTTCTCTG 1140 

GTCTACCTAT GACCTAGCTC GACGGATGGA 1200 

CCCTGCCATT GCTAACTCAT GGTATGATGA 1260 

TTACAAGTTT GTTTCTGAAT CAGATAGCCT 1320 

ATTTCTTGGT ATTTTAGCTG ACGATGTAAG 1380 

GAAAGGATCT GTTTATTTAG GTGGAGAGCA 1440 

TTGGGACTTG TTGGTTATTG ACGAGGCTCA 1500 

AGCCTTTAAT AAGATTCGAC GAAATTTTAC 1560 

ATTGGCTAAA GGAGATTTTA CAGAGGAACA 1620 

GGCTGCTAAG TATTCGTGGT CTCTTGAGCA 1680 

TCAGTTGAAT CTCTTTACCT ATCAAATGTC 1740 

CGCTCAGATC GATGGTGAAA ATATTGACTA 1800 

AGATGATAAA GGGAAATTTA TTCATGAGCA I860 

AAGCAATGAA AAATATCCAT TTTCAACCAA 1920 

TTGGCTTTTA GAACGTGTCG CTTCGGCCAA 1980 

AATCTATGAA AACTATGAGA TCGTTCTAGC 2040 

CGATAAAGTC AAACTCAAAT CCTTGGACTT 2100 

AACCATTACC CTATCCGTTG GTCAGCTGAC 2160 

TGTATTGATG TTATCAAATT TGAAATCACC 2220 

TCAAAATCCT TACTCATGGA GCGATAACAA 2280 

TGTATTTGAC TTTGCGCCGG AAAGAACCTT 2340 

ATTGCTTGTA ACTGCAGCTG GTAGAGGAAC 2400 

ATTATTAAAC TTCTTTCCAA TTATTGCCGIA 2460 

TGCAAAGGCA GTTCTAACCA CTCCTCGCCA 2520 

TTTTATGTCC AATCTCTTAT TTGATAATAT 2580 

TTTAGATATT TTAAATGAGC TGCCAGTTGA 2640 

TTTATTAGAT TTTTCAGATG TTACAGTCGA 2700 

AATTGTAGTT AATCAGCAAA TGCGACTTTT 2760 

TGTTGCTGAG TTAGTCACAA AAGATGAGGA 2820 

GAGTAAGACC GTTTCTTCAG TGATTGTAGA 2 880 
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GGAATTGAAA GCAGATTATT CTCTAAAAAC AAGGGAAACT GAGCAAATTA AGAAACAAAT 
TACAGCAACA CTTGAGAATG AAATTCGAAA AAATGATATC GAAAGAAAAA TTTCTGAAGC 
TCATATCAAG CAAGAGTTGC AACAGCAGCT CAAAGAAGCA AATGATAAAG CGCAAAAAGA 
TAAGATTCAA GAAGATTTGG AAAAACGTTT AGAAGAAAAT AAACTCATTC ATAAAGAAAA 
ACTAGAACAA ACACTCAAAA AAGAAGTGGA AAAAATGCCT GAGAAATTTA TCGAACAGGT 
TGAGATAAAA CGTGTGGAAC AGTTGAAACA ATCAGCTCAA GATGAAATTC GTGACCATTT 
ACGAGGGTTT GCAAGAACAA TTCCAAGTTT TATTATGGCT TACGGTGATC AAACTCTAAC 
ACTTGATAAT TTTGATGCCT TTGTTCCTGA ACATGTTTTT TATGAAGTAA CAGGGATTAC 
GATTGATCAG TTTAGATATT TGCGAGATGG TGGGCAGGAT TTTGCAGGGC ATCTCTTTGA 
TAAAGCAACA TTTGACGAAG CTATTCAAGA ATTTCTTCGC AAGAAAAAGG AGTTGGCGGA 
TTATTTTAAA GATCAAAAAG AAGACATTTT TGACTATATT CCACCGCAGA AGACCAACCA 
AATTTTCACT CCTAAACGAG TGGTGAAAAG GATGGTAGAT GATTTGGAAA AGGAAAATCC 
AGGGATTTTT GATGATCCAT CTAAGACTTT TATTGATTTA TATATGAAGT CAGGCCTCTA 
TATTGCAGAA CTTGTGAAGC GGTTATATAA TAGCAATGGC TTGAAAGAGG CCTTTCCAAA 
TCCTGAAGAA CGCTTAAAAC ATATTTTGGA AAAGCAAGTT TATGGATTTG CTCCGTCTGA 
GATTATCTAT AACATTTCCA CTAATTTTAT ATTTGGCAAT CTTTCTAAAG ATATCAGTAG 
GAAGAATTTT GTTTTAGCAG ATACCATTCC AGCGGCTAAA GAAGGGAGCA TTCAAAAGTT 
GGTTGATTCC TATTTTGAAA ATAATTAAAA AGAAGGCCGA GTCAAAATTC TTTGAAATCA 
GAAAAAACGC ATAATATTGA GTGCTTTTGT ACTGCCCCCC AAAAGTTAGA CAGAAAAAAT 
CTAACTTTTG GGGGGCAGTT CAGACAATCC TTGGTATTAT GCGTTTTATT GTGGGAAGAT 
GTATAATGGA TTGAAATAAG ATATGAACAA ATCAATTAGG AATTTAAAGC ATTTTATAAC 
AACGTTTTAG AGTAATGGGG GGCTATTTCA ACTTCAACCT ACTATAATAC AGAAAAAAAC 
AACTCCCTGA TAATTCAAGG AGTTGTCTAT AGTTAAATTA GTTTTTAGAA GCTTCTTGGA 
ATTCTGGGTT TTTCCATGCT TCGTCAATGA TAGCTTGTAA TTCTTTAGCA GATGCTTGCA 
TTTTTTGAGT TTCTGCGTCG TTCAATGGGA TATTTACTGG ACGAACGATA CCATGTGCAC 
CAACAACAGC TGGTTGACCG ATAAAGACAT TCTCAACTCC GTATTGACCT TCTTGGAATA 
CTGAAAGTGG AAGTACTGCG TTTTCATCGT CAAGGATTGC TTTAGTGATA CGAGCAAGGG 
CTACTGCGAT ACCGTAGTAT GTTGCACCTT TTTTGTTGAT GATTGTGTAG GCTGCATCAC 
GAACACCTTC GAACAATTCA ATCAATTCAG CTTCTTGAAC ATTTTGAGTG TCTTTAAGGA 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4 620 
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ATTCTTCAAG GTTTACACCA GCGATGTTAG CGTCTgIcCA AACAGCGAAC 
CGTGTTCACC CATGATGTAG GCGTGCACTG AACGAGCATC CACATCCAAT 
GTGCTTGACG GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG 
GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA 
GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GATTGATTTG 
GGTTTTTACC TACAAGGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA 
TCACAACAAG GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT 
AAGTGAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT 
GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT 
AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT 
TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1941 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



TCAGAGTCAC 
TTTTCAGCAA 
CGTTCTTTAG 
GCAGCAACAA 
TTGATAGCAA 
CCTGCAGTGA 
TTTTTAGGTG 
TCATGCAATT 
GCAAAAGCGT 
TTAGTTGAAG 
ACTTCATT 



4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5278 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
ATAAGGAATC TCTAAAAAAT TTTAAGGAGA ATCTAGCAAA TGGATTTCAC ATGGGCACTG 
AAGTATGCCA CTGAATTTTT GGGAACTGCC ATTTTGATCA TTCTTGGGAA TGGTGCAGTT 
GCCAACGTTG AACTTAAAGG TACGAAAGGT CACCAAAGTG GCTGGATCGT CATCGCTGTT 
GGTTATGGTA TGGGGGTTAT GATCCCAGCC TTGATGTTTG GTAACGTATC TGGGAATCAC 
ATCAACCCTG CTTTCACTCT AGGGCTTGCA GTTAGCGGTC TTTTCCCTTG GGCACAAGTG 
GTACCTTACA TTATCGCGCA AGTCTTGGGG GCTATCTTTG GCCAAGCCTT AGTTGTGGCA 
ACATACCGTC CATTCTACTT GAAAACTGAA AACCCAAATA ACATCTTGGG AACTTTCTCA 
ACTATTTCAA GTATTGACCA TGGTACAAAA GAAAGTCGCT ATGCAGCAAC TGTCAATGGT 
TTGATTAATG AGTTTGTTGG TTCATTTGTT TTGTTCTTTG CAGCTCTTGG TTTGACTAAA 
AACTTCTTTG GTGCTGAAGT GCTTCAATTC ATGAAACAAA AGGCAACAGA AGCAGGACAA 
ACAGTTGATT TTTCTGACTT GGCTATTAAA GCACAGGTGG CTCCACACAC TGCTTCAGGA 
CTTTCTGTGG CTCACTTGGC ACTTGGATTC CTCGTTATGG CTTTGGTAAC ATCACTTGGA 
GGACCTACAG GACCTGCCTT GAACCCAGCC CGTGACTTGG GACCACGTCT CCTTCATGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TTCCTTCCCA AATCAGTTCT TGGTGAGCAT AAAGGCGATT 
GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG 
TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC 
TCAGACTGGk TCTCTCTTTT kGATTTTTaG GGAAATGAAA 
TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCCAGCT 
GCAGTTGAAA TAATAGACCC TTGTTTCTAA AACATTGTGA 
AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCGG 
CCCTATCTTG TAAGTCTGCT TTATAGTGGG TTGAAGTTGG 
CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAAT 
TACTATAAAA TAAGCGATTA GGGGGGCTAT TCTTCGACCT 
TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG 
GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT 
AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA 
TGCCCTATTC TTGTTTCATT TTACTATATA AACCAGAGAC 
GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGGATAAGT 
ATTAGTTAAA TTTTTACCAA GAATAATTCA CAAAAACGTT 
CTGAAATTTG ATAAAATAGT AAGGAAAGTT AGACTGTATT 
ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA 
ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACGGT 
TATCACAACT GTTTTGGCAC G 
(2) INFORMATION FOR SEQ ID NO: 229: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



CAAAATGGTG 
TAGCTGTATT 
CATCTATAAG 
GAAcTCTAAA 
TCAATGGACT 
GAAATTGGTT 
AATATTATCG 
AATAGTCCTC 
GTTCAGTTTC 
ACATTGACTC 
AAAATCAAAG 
GTATAGTAGA 
TTTGACTGTC 
TGTTTACATT 
CTATTTAGTA 
GTAAAACACT 
GCCTACTGTC 
AAATACGATC 
AAAACTACCC 



GTATTCTTGG 
CAAATTCCTT 
TAAGAGAGGA 
CAAACTCCTC 
ATAGTAGGTT 
TGAATTCTCC 
GAGATGGGTT 
CCTTCTTTCT 
TATTTCATTT 
TGCTGAGTCC 
GGCAAACTAA 
TTGAAACTAG 
CTGAACGATT 
TTCAGCAAGT 
CTTTCTATTA 
TGCAATTTAG 
TATCTATAAA 
GTAGTAAACC 
TAACTGCAGC 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1941 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTAGCCTGT TGTTAAAAAG ATAGGGAGTG 120 

ATAATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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240 
300 
360 
420 



600 
660 
720 
755 



GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TGCTCTTGCT 

GGTTGTTGAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 

ATTTAAATTT TCCGTATTAG TCTTGCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 

TGATTGCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 

TTAGGTTTGT TATGAGAATT GTTGATAAGA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCAGTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 540 

ATGGTTAG AG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 

GAAGAAATAT TTTTAAAATA TCTTGGGACT TTAATATAAC ATTATCTGAA AAATTAAACT 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTGTCATTAT ATTTGCAACG 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
CCAGAAAAAC CGTAGTGGAG CTCGTGGAAC AGTGGAATTG ATTTTCCAAA AAGAATACAA 
TAAATTTTCA AGTATCTCAA AGAGGGAGGC ATAAGATGTC AGATGCATTT ACAGATGTAG 
CCAAGATGAA AAAAATCAAA GAAGAAATCA AGGCACATGA GGGACAAGTC GTAGAAATGA 
CTTTGGAGAA TGGTCGTAAG CGCCAAAAAA ATAGATTGGG TAAGCTAATT GAAGTTTATC 
CATCTCTATT TATTGTGGAG TTTGGGGATG TGGAAGGAGA TAAACAAGTT AATGTTTACG 
TTGAATCCTT TACTTACTCA GATATTCTTA CAGAAAAGAA TTTGATTCAT TATCTTGACT 
AAAGTGAGAA ATTTTCTCAC TTTTTCTTTT TTCTCCGAAT AATTTAGGTG AAGGCAATCA 
TCGCTTTATA TTATTTTTCA AGGAGGAAGA ATGAAAATTT TACCGTTTAT AGCAAGAGGA 
ACAAGTTATT ACTTGAAGAT GTCAGTTAAA AAGCTTGTTC CTTTTTTAGT AGTAGGATTG 
ATGCTAGCAG CTGGTGATAG TGTCTATGCC TATTCCAGAG GAAATGGATC GATTGCGCGT 
GGGGATGATT ATCCTGCTTA TTATAAAAAT GGGAGCCAGG AGATTGATCA GTGGCGCATG 
TATTCTCGTC AGTGTACTTC TTTTGTAGCC TTTCGTTTGA GTAATGTCAA TGGTTTTGAA 
ATTCCGGCAG CTTATGGAAA TGCGAATGAA TGGGGACATC GTGCTCGTCG GGAAGGTTAT 
CGTGTAGATA ATACACCGAC GATTGGTTCC ATTACTTGGT CTACTGCAGG AACTTATGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
840 
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CATGTTGCCT GGGTGTCAAA TGTAATGGGA GATCAGATTG AGATTGAGGA ATATAACTAT 
GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AGGATTTATT 
CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 
GGAACTCATT ATTTTAAGAC CAAGTCTC-CT ATTAAAACTG AACCTCTAGC TAGCGGAACT 
GTGATTGATT ACTATTATCC TGGGGAGAAG GTTCATTATG ATCAGATACT TGAAAAAGAC 
GGCTATAAGT GGTTGAGTTA TACTGCCTAT AATGGAAGCT ATCGTTATGT TCAATTGGAG 
GCTGTGAATA AAAATCCTCT AGGTAAtTCT GTTCTTTCTT CAACAGGTGG AACTCATTAT 
TTTAAGACCA AGTCTGCTAT CAAAACTGAA CCCCTAGTTA GTGCAACTGT GATTGATTAC 
TATTATCCTG GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 
TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 
TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 
(2) INFORMATION FOR SEQ ID NO: 231: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 
CCCGGAAAAC AAGTTAAAGT TGAAGTTGGT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 
GAAGCTGGTC AAGAAGTTAC TTTTAACGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 
TCGGAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 
AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 
ACCGTCAACC ATATACAAAA GTTGTCATCA ACGCAATCAA CGCTTAATTT TAAGGAGAAC 
ACATGATACA GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 
CTGGACACGC CGAGAGTGGC GAATACGGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 
TTGCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 
TAAACGAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 
AGAGAGAAAT GACCCAGTTA TTCTTTGAAT CATTTTTCTT AGGTATGGCA AACTTATCGG 
AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GGAGGAAAAC 
ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 
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TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAAGC AGCTGACGGA 
CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 
GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 
TTTGAACGTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 
GGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 
TTAAAAT 

(2) INFORMATION FOR SEQ ID NO: 232: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY : linear 



780 
840 
900 
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1027 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
CGGTTCAAAT GGTGCAGGTA AATCTACGTT AATTAATTCT ATTGTAGGTT TTCAAGAGAT 
TTATTTAGGA GAAATAGAGT ATTGTGATAA AGATTTGATA GTTAGTTCTC AACCTTTTGC 
TCATTTAGGC TTTACTCCTC AAACCACAGT AATTGATTTT TATACTACTG TGAAGGACAA 
TGTAATATTG GGGCTGAACC TTGCTGGAAA GTTTGGGAAA AATGCTGAGA AGTTGTGTCA 
AATAGCCTTA GAAATTGTTG GGTTAGCTGA TAAAAAAAAT AATTTGGTAG AAACATTGTC 
AGGTGGACAA CTGCAACGCG TCCAGATTGC TAGAGCAATA GCTCATAATC CAGATTTTTA 
TATTTTAGAT GAACCTACCG TTGGTTTAGA TACTGAATCT GCCGAAAAAT TTTTAATGTA 
TTTAAAAGAT AAGAGTTTGG AAGGAAAAAC TATTATCATA TCTTCACATG ACATAAATCT 
ACTCGAAAAG TTTTGTAAAA AAATACTTTT TTTACAAAAT GGCTCCATAT CATTTTTTGG 
TGATATGCGT GACTTTGTAG ATAATTCAAC TATCAAATTA AATTTTTCAA TGCAGAATAG 
AATTTCTAGA TATCAAATTG AATTTTTAGA AAATTTTAGA TTTAAAGTTC ACATCGAAGA 
TAATGATAGT TTTACAATAG AAGTCCCTAT AGAAGAAAAG ATCTTAGATG TTATCAATGA 
GGTAGGAAAA GCATGTGAAA TTAAAAACTT TTCAACAAGT AAATTAACCT TACAAGAAAG 
TTATTTGCAA AGAATAGGAG GAGAAAAATG AAGGCTGATC AATTAAGGCA CAAATCGGAC 
TTAGGTTTAA GAGGTCTAGC GATTATTGCT AAAAATGAGA TTATTGCTTT TTTTAGAAGT 
AAAGGTTTAA TTATTTCTCA GTTTCTACAA CCAATCTTAT ATGTTGTTTT TATAATAATA 
GGATTAAATT CTTCGATAAA GAACATTCAG TTTAATGATA TAAAAACCTC TTATGCAGAA 
TATACAATCA TTGGTGTTAT AGCTTTATTG ATAATCGGGC AGATGACTCA AGTTATTTAT 
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AGGGTGACAA TAGATAAAAA ATATGGGCTA CTTGCTCTTA AGTTATGCAG TGGAGTTCGT 
CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TAGGGTTGAT AGTTCAAGAA 
ATTATTATAT ATATAATTAC GTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 
TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTTGGG ACTCCCTTGC AATTTTACTT 
ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 
. CTTGGTTTTA CAGCTCCTGT TTTCTACTTA ATAGATTCTG CTCCTAGTAT TGTGAGATGG 
ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 
AATTCAACAA CTTTGGAATT AGTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 
GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTTGGGTCAT 
CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 
AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 
CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAAGTTTAT CACATACCAC 
ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCGTCAAA TCATTTATGG 
CAGATACAAA GCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 
AATCCATGAA GCGAATGCTT GTTCAGACCA TATTCACATG CTTATCAGTA TTCCTCCGAA 
ACTTAGTGTT 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 
GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 
ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCGCCATAT TTAGGTCCTG GACGTGGAAT 
CATTGAAAGC TCAACTGGAC GTATTCTTAT TCCGTCTTAC ACTGGTAAAG AGTCTGCGTT 
CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 
TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 
TACAAATAAT GGTAAAATTG CATATTTAAC AAGTAAAGAC GCAGGTACTA CTTGGAGTGC 
ACCGGAATAT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 
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TTATAGCCAA TTGATTGATG GTAAAAAGGC TGTCATTTTA AGTACTCCAA ACTCCACAAA 480 

TGGTCGTAAA CACGGACAAA TTTGGATTGG TCTAATTAAT GATGATAATA CAATTGATTG 540 

GCGTTATCAT CACGACGTTG ATTATAGTAA CTATGGATAC TCATATTCAA CATTGACAGA 600 

GTTACCAAAT CATGAAATTG GATTGATGTT TGAAAAATTT GATTCATGGT CTCGTAATGA 660 

ACTTCATATG AAAAATGTTG TACCATATAT AACATTTAAG ATTGAAGATC TGAAAAAGAA 720 

TTAAAGCTGA AATTTGAAAA TATATAAAAA GAGGATAAAA ATTATGGTAA ATTACGGTAT 780 

TGTTGGAGCT GGATATTTTG GAGCTGATTT AGCTCGCTCA ATGAACAAAA TTGAAGATGC 840 

AAAAGTGGTT GCGGTATTTG ACCCAAATCA TGGAGAAGAA GTTGCTCAAG AGTTGGGATC 900 

AGATGTTTGT GCAAGTTTAG ATGAACTTGT AGCACGTGAA GATATTGATT GTGTGATCGT 960 

AGCTTCACCT AGCTACCTTC ACCGTGAACC AGTTGTGAAA GCTGCTCAAC ATGGCAAACA 1020 

CGTATTTTGT GAAAAGCCAA TTGCATTGTC TTATGAAGAT TGTAAAGCCA TGGTTGACGC 1080 

ATGTAAAGAA AATAATGTCA TCTTTATGGC TGGTCACATC ATGAACTTCT TTAACGGTGT 1140 

ACACCATGCT AAAGAATTGA TTACTCAAGG TAAAATCGGT AAAGTTCTTT ATTGCCATGC 1200 

TGCTCGTACA GGTTGGGAAG AACAACAACC AACTGTATCA TGGAAGAAAC TTCGTTCTCA 1260 

ATCTGGAGGA CATTTGTACC ACCATATTCA TGAATTAGAT TGCATTCAGT TTATCATGGG 1320 

AGGACTTCCT GAAAAAGCGA CAATGGTAGG AGGCAATGTA TATCATAAAG GTGAAAACTT 1380 

TGGTGATGAA GATGATATGC TCATTGTAAA CTTAGAATAC TCTGATGATC GTTATGCTGT 1440 

TTTGGAATAT GGTAATGCTT TCCGTTGGGG TGAACACTAC GTCTTGATTC AAGGAACTGA 1500 

AGGAGCTATC AAACTTGACT TGTTCAATAC TGGCGGTACT CTTCGTGTTA AAGGTGAAGG 1560 

AGAATCACAC TTCTTAGTTC ATGAAACTCA AGAGGAAGAT GATGATCGTA CAGCTATCTA 1620 

TACCGGTCGT GGTATGGATG GAGCAATTGC GTACGGTAAA CCAGGAGTAC GTTGCCCATT 1680 

ATGGTTGCAA ACATGTATTG ATAAAGAAAT GGAATATCTA CATGACATCA TTAAAGGTGG 1740 

AGAAATTACA GAAGAATTTG AAAAACTTCT CAATGGTGTA GCTGCTTTAG AATCAATCGC 1800 

TACCGCTGAT GCATGTACTT TATCAGTTAA AGAAGATCGA AAAGTAAGTC TTTCAGAAAT I860 

CACAAATGCT TAACTTTTGT AAAACAGAAT AGTAAATTCT TGTCATTATA TAATTTCTAA 1920 

AGTTCTGTGA TACAACTCAT TGAATAAAGA AATAGAGATG GGACTGGGAT AATGCCCAGT 1980 

CCCATTTTTT ATCAAAAAGT AATGAGATCA AAAATGTGGG AGTGTTGAAA TGAAGATTAT 2040 

AGGTATCGAT ATTGGCGGAA CAACAATTAA GGCAGATTTA TACGATGAGT TTGGAACGAG 2100 

TTTGAATCAT TTCAAAGAGA TAGAAACAAT TATTGACTAT GATTTGGGAA CGAATCAGAT 2160 

ATTAAATCAG GTCTGTGATT TAATTGGTGA GTATACTTTA AATCATTCAA TTGATGGTGT 2220 
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TGGGATTTCC ACTGCTGGAG TTGTTAATGC TAATACTGGA GAAATCATCT ATGCAGGCTA 2280 

TACAATACCA GGGTATATCG GAGTAAACTT TACTGCCGAA ATAGAAAAAC GTTTTGGGTT 2340 

GTATACTTTT GTTGAAAATG ATGTTAATTG TGCTGCATTA GGTGAATTGT GGAAGGGACA 2400 

AGCCAAAGAT AAGAAAAATG TAGTAATGGT TACTATTGGA ACAGGTATAG GAGGCAGTAT 2460 

TATTGTCAAC GGACAAATTG TTAACGGATT TAACTATACT GCTGGTGAAG TAGGTTATAT 2520 

TCCTGTAGGT AATTCGGATT GGCAAAGTAA AGCCTCAACA ACCGCATTGA TTCATTTATA 2580 

TCAAAAAAAG AGCTTGAAAA CTAATCAAAC TGGACGTACT TTCTTCACTG ATTTAAGATC 2640 

TGGAGATAAA GTTGCTGAAG AAACTTTTGA AATTTTTGTA GAAAATCTAA CAAAAGGTTT 2700 

ATTAACGATT TCTTATCTAC TTAATCCAGA AATTCTCATA TTAGGAGGTG GGATTCTGGA 2760 

TAGTAAGGAT ATTTTGTTAC CTGAAATTCA AAGTTCTTTA GCTAAAAATG CAATGGATAA 2820 

TAGGTTTTTA CCTAAAAATC TTGTGGCAGC TACATTAGGA AATGAAGCTG GTCGTATAGG 2880 

AGCTGTAAAA AATTTCTTAG ATAGAATTTC TAATAAATAG TATGTAAGAT AAGGAGGTGT 2940 

CACAATGACT AACTCTGTAT TTTCGACAAT GCAAGATATT GAGAATGTTG CAACCGATAT 3000 

TATAAAATCA TATGATAATG AGATTTATAC TTATAAAGCT GTTTCCCAAG AAGAATTGGA 3060 

AAAACTAGAA AAAAGTTATG ATGAAAAAAG TCACGAAGAA TTAGTTTCAA TAGAAAGCAA 3120 

TTTAGAAATG AAACAACAGA ACCTTATTGA TGAGGTTAAT AAAACAATCA AGGAAAATGA 3180 

TGCAAATATT CAGTATATTT CATCAAGTAG GAGAGGAGAA TTTGTAGAAA AAATTATTGG 3240 

TAGGGTGGTA GAAAAATATG GCCATTAGTC AGATGAAAAG AATCTCTCTA CTATTTTCTA 3300 

AAAGTAGTCT TGATGATGTT TTAAAAACTA TTCAAGAACT AGAGTCAGTG CAGTTCCGTG 3360 

ATTTAAAGGT TCAGGATAAC TGGTCAGAAG CTCTAGAAAA AGATGAAGTT GTATTTCCAA 3420 

CTATTCAAAT TTTTCATACT TCTAATTCCA ATCATGGGGT TATTGAGGGA AATGATGCCT 3480 

TGACTTATTT GATGAATCAA CAACAACATT TAGAAGCAAC TGTAGAGAAA TTACAAGAAT 3540 

ACCTACCGAA AGAAAACACG TTTAAATTAT TGCAGCAACC TCCGATAACT ACCTCTTATG 3600 

AAGAATTAGA GAAATTTGGT AAAGCTAATG TTGCTGAGGG TGTTCTTAAA AAAGTGAATC 3 660 

ATCAAATTAA CAGAGTTCAT GAATTAGAAA GACACATTCA AAGTAATAAT GAGGAAATAG 3720 

AGCGATTAAT AAAGTGGGAA AAATTAGAAA TTGTTCCTGC GAATTTAGAA CAATTTTCTT 3780 

TCTGTAAAGG AAAAGTCGGA ACAATTCCAA GGACTGAAGA TAATCGCTTA TACAATAGTC 3840 

TTTTAGAAAA CAATATTGAA GTTCAAGAAA TATTTTCTAA TGATAGAGAG TACGGTGTTG 3900 

TTGTTTTCTA TCAGTCTAGT TACTCTATAG ATTTTGATGA ATACTTATTT GAACCATTTG 3960 
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ATTATTCTAG AAAGGAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATGC 
AGTTAATAAC TGAAAAAGAG AATATTATCG CATCGTTGCA AGATTCAAAG AAATATTTGA 
TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 
ATAACTTTTT GTGCACTCCG CATCTAGTTG CATTAGAAGG ATGGATAGAA GAAACTCGTA 
TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 
CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 
TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 
CTACACCTGT ATTAGCACCA TTTTACTTTA CATTTTTTGG AATGATGGTT GCTGATTTAG 
GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 
CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 
GTGGAATCTA TGGCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 
CTGATGTCAT GACTATATTA GTAGTGTCAG TTGTGTTTGG GTTTATTACA GTATTTGCAG 
GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 
CAGGATTTGC GTGGTGTGTT ATTCTG 
(2) INFORMATION FOR SEQ ID NO: 234: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology: linear 

(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 234: 
CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 
CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 
AAAAACCATC AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGG CTGAGATGGA 
TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 
TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 
TCCAAATGAT GATGTATACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 
aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 
TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 
GGACTCTTCG CCTGGCTGCT TATTTGAGAG AACATAACTT TGACTTTGCG GCTCTCACTC 
GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 
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TGGAAGTGGA 
ACAATATAAC 
TGAGTCTCTG 
GTAAAGTCCA 
CAAGTGGTGC 
ACTTGCTTAA 
GTAACAATCT 
AGATATCCAT 
CCTTAGCGGT 
ATTGATCCGT 
CACTCAAGCA 
AGAGAACAGT 
AGACTCATCT 
ACGTCCATAA 
ATTCATTTTC 
TAAGGTTTCA 
CGATTACCGA 
TTCATGACAT 
TTCCACACTT 
TATTAGTCGG 
AAAGGATAAG 
CCTGAAATAA 
CAAGAAAGGC 
CTTGAAGGAG 
TTTGATTACA 
TAGTAGATTG 
GTCCTGTTCT 
TACGAGAGTC 
TACTATTATA 



TGAAAATGGT 
CGATGCTGAA 
GGGAATTTTT 
TCCTATCAAT 
TAATTCCTAT 
AAACTGATAA 
ATGGCTCGCA 
CCAGAATATC 
TCAACAAAAC 
GTGGAAATTT 
GATGGACGCG 
TTTGGCTGTT 
GTAGGTTCGA 
TGAGCTATAC 
TTAAAAGCTT 
AAACGAGGAC 
TGGACTTTAT 
CTTCCAAAGT 
GTTCAATGGG 
AATATTTAAG 
CTTGTGAAAG 
GGCATCAATT 
TTTCTCTTCC 
AGAGTCGAGT 
GTCTAAAAGC 
AAACTAGAAT 
TGTTTCATTT 
TTTTAAAAGT 
TAATGCTTTT 



GCTGCTCGCG 
ACTGCGGCCA 
GTCGAACAGG 
GAAATTGCCA 
AGCCTAGAAG 
AATACTTGCC 
AAGAGACCAT 
GCCCAGTTGT 
GCTCTAACGA 
CATCAGACTC 
TGGATCGTTT 
CTTTTTTGTT 
TTTCCATGCT 
TATTGTCACG 
CTCGGACTTT 
TTATACTCAT 
CACCTCCTTC 
TCgAAAGGCT 
TTCATCTCTG 
GTACTTGATT 
CTCTTCTAAA 
GTAACAAATT 
TCAACTGTCA 
CCGAACTCCT 
TCTATAATCT 
AGTACACCTC 
TACTATAGAA 
GTTTTGATGG 
TGATTTTAGT 



1255 
TTATCCTGAG 
TTGTAGGTGC 
CTGATGGCCA 
AGGAGCATGA 
AAAACGAAAT 
AAACTTTTCA 
GGCAGAAAGG 
CTTCATGGAC 
AACAGTTGAG 
ACACCCATTC 
CAACAAAAAA 
TCTTGAAATC 
ACTAGGCAGG 
AACCACACTT 
TTCCAAATCT 
CTGCTTTCAA 
TCCAGTCCTT 
TTATTCTTAA 
GTGTGTATGG 
TATGCCATAT 
AAGGCGTTCA 
CTCCTGCCTC 
TATATGCATG 
CATATTTTTT 
CTTTATAAGA 
TACTTCTAAA 
CGATTTGAAG 
TTTGGATTTC 
CTGGTATAAA 



TCAGAAAATC 
ACCTGGACGC 
CTACCGAGTT 
TGGTGGAGGC 
CATCTACCAA 
GAATCTGATA 
AAATATTGCA 
ACAACTACTG 
TTCGAAGGCG 
TACACTGGAC 
TACGGTCTCA 
AACTGCTGTT 
AAGGAAATAG 
TCATTGATGG 
TTGGAGGCAA 
AAAGCATTCT 
GTATGACATC 
ATCCACGTTT 
AGGAATAAAG 
AGCATTGTCC 
TCCACACTCC 
TGTAGCCTTC 
GTTACGACCA 
TACGTTTCGC 
TTTGCCCATC 
ACATTGTTAG 
GCGTTTATAA 
TTCTTTAGTT 
TATTGCTTTC 



TTGAAACAAT 
ATTGACAGAG 
CGCTTACGCA 
CACCCTCTAG 
AAGTTAGAAG 
GACTAGTATA 
AAATGAAAAr 
GTTACCArTT 
AAACTTACCC 
GTCAAAAGTT 
AATAATGATA 
TTCATGTTCC 
CTGTTTCAAC 
TCCAAGTGGA 
TGGCCTGCTC 
AGTCCATCTC 
TTGAAGTTGA 
ACGAATCTCT 
GTAAAATCAA 
ATAACGAGTA 
TTTTTATAAA 
AAATGACGGG 
CCACGTGTTT 
CAAATCGTTG 
AGACGAAATA 
AAATCGATTT 
TATTTAGCTG 
GATTTCATAT 
CTCCAAAATG 
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GTCATAGTTT TACTGGCAAA TCTAACATAT CACGGATAAA TTAACAAGTG ATTTCTGAAT 
TGCTAAACAT TTTCTTTTCT TATAGCATAC TTTAAGATTT TGTCTTTGAG AAAGATATTT 
CCAAGAAAAA CGTTCGTTTT TTGG 
(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1756 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



180 
240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

CTAGATATAG CTATAATTTT ATTTATAACA AGAGGATAGA AATGACCGAA TTAGAAAGAA 60 

AAAATCGAAA AATTAGCTAA GAAATATTCT GATAACTTAA ACATCAAAGT TCAAGAGAGA 120 
GTTCGTGAAA TGGCAAATGA TAATAAGAGC CATTATTTGA TATACAGAGT TTTAGGTATT 
TCATTTGAAG AAGGAGAAAA TATCGATTTG TATCAAAATA AAGGTCGTTT TTTATACAAA 

TATGCTGGTT CATTTTTAGA AGAAGCTGCA GTACTATGCT TTAACGAAAA ATTTGGTACA 300 

GAAAATACTT AAAAAGTTAA CATTCCTAAT TCTGAAAGTA CAAAACCTAA GACTTTTGAA 360 

ATTGATTGTT TAGTCGGAGA AAAACACGCA TACGAAATAA AATGGTGGGA TGCAACTACA 420 

GATGGAGACC ATATAACTAA AGAACACACT AGAATAAAAG TTATTCATAA CAAAGGATAT 480 

ATACCAATTC GGTTAATGTT CTACTATCCA AATAGAACTC AAGCTATAAA AATTCAGCAA 540 

ACTTTAGAAA CATTGTATAA CGGTATTGGA GGGAAATATT ATTATGGAGA TTCTGCCTGG 600 

GAACATTTAA GAGCAGTGAC CGGTATTGAT TTACTTAGTA TTCTAACAGA TATTGCAAAT 660 

AAAAAAACAG GGGTAAAATC AAAATGACAG TATTAAAAGG AGATAACTTA GAAATATTAA 720 

AAACTATTGA ATCCTCAAGT ATTGATTTAA TCTATATGGA CCCTCCTTTC TTTACACAGA 780 

AAACCCAAAA ATTATCTAAT AACAAAAATA TTATGTATTC ATTCGAAGAT ACGTGGACkT 840 

CGATTGAGGA TTACAAAGAA TTTTTGTCTG TAAGATTAGA AGAATGCAAA AGAGTGCTAA 900 

AAAATAGTGG CAGTATTTTC GTTCATTGTG ATAAAATTGC AAATCATCAT ATTAGATTAA 960 

TTTTAGATAA TATCTTTGGA GTAGATATGT TTCAAAGCGA AATTATATGG AACTATAAAC 1020 

GGTGGTCTAA TTCAAAAAAG GGATTATTGA ACAATCATCA AAACATTTAC TTTTATTCAA 1080 

AGTCAAAAGA TTTTAAATTT AATACAATTT TTACAGAGTA TTCTTCTACT ACAAATATCG 1140 

ACCAAATACT AGTGGAACGA AAACGAGATG GAAACTCTAA AACTATATAT AAGGTTGATA 1200 

ATAATGGTAA CTATATTCTA GCAAAAGAGA AAAATGGAGT TCCCCTTTCA GATGTTTGGA 1260 
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ATATACCATT TCTTAATCCA AAAGCTAAAG AAAGAGTAGG TTATCCTACA CAAAAACCTA 
TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 
CGTTCTGTGG AAGTGGAACT ACTTTAGTAG CCTCCAAGAT TTTGAATAGA AATTATATGG 
GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGGAA AATGTTATAA 
AAACAAGTTC AAATTTATTG AATAAAGGAA TCGAAGCATA TAGAACCAAA ACTGAGGAAG 
AGGAAAACAT TCTTAAATTA TTACAGGCAA AAATTGTTCA AAGAAATAAA GGAATTGATG 
GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 
AATGTCTGAA TGAGAGTATC TCTTTATTAC AGAATGCTAT AAACTCCAAA AAACTTGATT 
TTGGAGTAGT TATAAAAACT CATTCG 
(2) INFORMATION FOR SEQ ID NO: 236: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

CCGAAAATCA AATTCAAACC ACGTCAACGT CGCCTTGCCG TACTCAAGTA CAGCCTGCGG 60 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AATAATATTA 120 

GCGCGGAGAA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTC 180 

TCATCTAATA CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 240 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTGG 300 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 360 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCACATCTT CTTGGGTCAT GATTCCCTTG 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 480 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTGTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCCGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTGTTCATCA TGTAGTAGAC AAACTCTTCA 660 

TACTCCAACA CTTGCCCATC AAAGCGAATC TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTACAG 748 
(2) INFORMATION FOR SEQ ID NO: 237: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

AAAAGATTAC ATTGCAACAA TTGAAAATTA TCCAAAGGAA GGCATTACCT TCCGTGATAT 60 

TAGTCCTTTG ATGGCTGATG GAAATGCTTA TAGCTACGCT GTTCGTGAAA TCGTTCAGTA 120 

TGCTACTGAC AAGAAAGTCG ACATGATCGT GGGACCTGAA GCTCGTGGAT TTATCGTGGG 180 

TTGTCCAGTT GCCTTTGAGT TGGGAATTGG TTTTGCGCCT GTTCGTAAGC CAGGTAAATT 240 

GCCACGCGAA GTTATTTCTG CTGACTATGA AAAAGAGTAC GGTGTCGATA CCTTGACTAT 300 

GCACGCGGAT GCCATTAAGC CAGGTCAACG TGTTCTTATT GTAGATGACC TTTTGGCGAC 360 

AGGTGGAACT GTTAAGGCAA CTATCGAGAT GATTGAAAAA CTTGGTGGTG TTATGGCAGG 420 

TTGTGCCTTC CTTGTTGAAT TGGATGAATT GAACGGCCGT GAAAAAATTG GTGACTACGA 480 

CTACAAAGTT CTTATGCATT ATTAATGAAA ACAGTCCCTA GGGCTGTTTT CTCTACACTA 540 

GGATATAAAA ATAGACTATA ACTAGTTAGA GAAAAACTAT AATTGAAAAC TATATCTTCT 600 

TGCAGTATAA TAAAAGGACT AAGTGTTTGA GATTTGTCTT CAAACATATG CAATTATTCC 660 

TGAAAGAGTA CAGTTAGGAG AGGGTTATGC CGATTCGAAT TGATAAAAAA TTGCCAGCTG 720 

TTGAGATTTT ACGGACAGAG AATATCTTTG TCATGGATGA TCAACGTGCT GCCCACCAAG 780 

ATATCCGTCC TTTGAAGATT TTAATTTTAA ATCTCATGCC ACAGAAAATG GTCACAGAGA 840 

CCCAGTTGTT GCGCCACTTG GCTAATACAC CCCTACAACT GGATATTGAT TTTCTCTATA 900 

TGGAGAGCCA CCGTTCTAAA ACAACTCGTT CAGAGCACAT GGAGACCTTC TATAAAACTT 960 

TTCCTGAAGT CAAGGATGAG TATTTTGATG GGATGATCAT CACGGGTGCT CCAGTTGAGC 1020 

ATTTACCATT TGAGGAAGTG GACTATTGGG AGGAATTTAG ACAGATGCTT GAGTGGTCTA 1080 

AGACTCATGT CTATTCGACC CTTCATATCT GTTGGGGGGC TCAGGCTGGG CTTTATCTGC 1140 

GCTATGGTGT AGAAAAATAC CAGATGGACA GTAAGCTATC AGGTATTTAT CCTCAGGACA 1200 

CCCTAAAAGA GGGTCACCTT CTATTTAGAG GCTTTGATGA TAGCTATGTA TCCCCTCATT 1260 

CACGGCACAC GGAGATTTCT AAGGAAGAGG TCTTAAACAA GACCAATCTC GAGATTTTAT 1320 

CAGAAGGACC TCAGGTTGGG GTTTCTATTw TGGCCAGTCG TGATTTACGA GAAATTTATA 1380 

GTTTTGGTCA TTTGGAGTAT GACCGTGATA CTTTGGCAAA AGAGTATTTT CGAGATCGTG 1440 

ATGCAGGTT ^ MMn 

1449 
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(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TACCCGCTTC TTTCAAGAGT TGGAGCAGGG CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTAGT GCTAATGTAT AGTCTACCAT 120 

ACTATGGATA ATTGTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 180 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGGCATGAAC 240 

AACAGCCAAA ACAAAGTTGA AAATCCAGGA AGATTTTGGT TTATCTAGGG TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 360 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 420 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGGCTCC CCAAAGTATA 4 80 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 540 

CGTTAAAAGG GCTGTCATTG TCATAAATTG TGTCTTTTTC CGTGTATTCA CAAGAATCTC 600 

CTTTTTAACT GCATATACAC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA 660 

CTTGGGTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 720 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTGT 780 

TCTTATTTCG TTTTACTATA GTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 840 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 

GGGA g04 
(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

CACTCAAACA TGACTTATAT CAAGACGGAT GGACTTCAAG ACGATGCCAA TCGCTTGAAT 60 
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CGTAACATTC AGTTTGGTGT TCGTGAATTT GCAATGGGAA CAATCTTGAA CGGGATGGCC 120 

CTTCATGGTG GACtTCGTGT ATACGGTGGA ACTTTCTTCG TCTTCTCTGA CTATGTGAAG 180 

GCAGCTGTCC GCTTGTCAGC CTTACAAGGA CTTCCTGTGA CTTATGTCTT TACCCATGAT 240 

TCAATCGCAG TTGGGGAAGA TGGTCCGACT CATGAACCAG TTGAGCATTT AGCAGGTCTT 300 

CGTGCTATGC CAAATCTAAA TGTTTTCCGT CCAGCAGATG CGCGTGAAAC GCAAGCAGCT 360 

TGGTACCTTG CAGTGACAAG TGAGAAAACA CCAACTGCCC TTGTCTTGAC ACGTCAAAAT 420 

TTGACTGTTG AAGATGGAAC AGACTTCGAC AAGGTTGCTA AAGGTGCTTA TGTTGTATAT 480 

GAAAATGCAG CCGACTTTGA TACCATCTTG ATTGCGACAG GTTCAGAGGT TAATCTTGCT 540 

GTCTCAGCTG CTAAAGAATT GGCTAGTCAA GGCGAAAAAA TCCGCGTAGT CAGCATGCCA 600 

TCTACAGATG TCTTTGATAA ACAAGATGCA GCTTACAAGG AAGAAATTCT TCCAAATGCA 660 

GTCCGCCGTC GTGTTGCAGT CGAAATGGGT GCAAGTCAAA ACTGGTACAA ATATGTTGGT 720 

CTCGATGGTG CCGTTCTAGG TATTGATACT TCGGAGCCTC TGCCCCAGCA CC AAAAGTAT 780 

TGGCAGAATA TGGCTTTACT GTAGAAAATC TTGTAAAAGT TGTTCGAAAC TTGAAATAAT 840 

CCTAAAAATC AGGGCGTAAG CTCTGGTTTT TCTTACCAGA AAAGTAAGGT ACAATCTTGT 900 

AAAAGTAGCT GAAATTTGAT ATAGTAGTCC TATGTAAAAG ACAAAG 945 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2764 base pairs 

(B) TYPE: nucleic acid 
<C} STRAND EDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

CGGGGCTCCc TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTGTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAACAC 120 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTACG GCAAGGCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGTCGTCC 240 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCTCTTG AAAAATCGTG 420 

GTCTCAAAGT AACCATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATGA 480 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACGG AGCTGAGACA GATTTGGACT 540 
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TGGGTCACTA 
GGAAAATTTA 
TTCAAGTCAT 
CGACCGACTC 
TGCCATTCCT 
ATATCCATAC 
CCCAACACTC 
GTACAGAAGA 
CACCAGAAGC 
TGCAGGCACA 
CGGATATGAC 
AGATTTCCCT 
CCTTGAAACA 
ATGATGTGAC 
CAGGTGGTTT 
AAAATGATGT 
CTCGTCACGT 
ACCCTATCAT 
GTTTGGGACT 
ATCAAGAAGT 
AGCAGTTTGA 
AAATCGTGGA 
CAAGCCGTCC 
ACAGCAATTA 
ATATTGCAGT 
AGCGACTCCC 
AATTTGAACA 
TTGTATAGTG 
AGAAATATTT 



TGAACGTTTC 
CAGTGAAGTT 
TCCTCATATC 
TGATGTCATT 
AGAGGCTCTT 
AACCTTGCTT 
TGTCAAAGAA 
GCCAGCTGGT 
CGTTATCGAA 
AGGGATGGAC 
AGAATGGTCA 
TGTTGGTAAG 
CTCTGGCTAT 
AGCAGAGAAT 
TGGTCAACGT 
TCCAATGTTG 
TTTAGGTCTT 
TGATATCATG 
TTATCCGTCT 
GGTGCAACGC 
GGCAGCAGGT 
AATTCCTGAA 
AAACCGACCA 
GCAAAATCAG 
ATATCTGAGG 
TCTTGCCCTG 
AATTAATTCG 
AATTGAAATA 
TAGCAGTCAA 



ATCGATATCA 
CTTCGTAAAG 
ACAGATGCTT 
ATCACAGAGG 
CGTCAGATGA 
CCTTACCTCA 
TTGCGTGGCT 
CAAGGAATTA 
TCGTTGGATG 
CAAATTGTTT 
GCCATGGTGG 
TATGTGGAGT 
GTCAATGATG 
GTAGCAGAAC 
GGTACAGAAG 
GGAGTCTGCT 
GAAGGTGCCA 
CGTGATCAGA 
AAGTTGAAAC 
CGTCACCGTC 
TTTGTCTTTT 
AATAAATTCT 
GAAGAACTCT 
AACCTTTGAG 
TAGGGGTCCT 
TGCTAGTGAA 
GAAAGCCAAA 
AGATGTGAAC 
GATGTACTGT 
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ATCTCAACAA 
AACGCCGTGG 
TGAAAGAAAA 
TTGGTGGAAC 
AGGCAGATGT 
AGGCTGCTGG 
TGGGAATCCA 
AAAATAAACT 
TTGAACACCT 
GTGATCATTT 
ACAAGGTCAT 
TGCAAGATGC 
CAGAAGTTAA 
TCTTGTCTGA 
GGAAAATCCA 
TGGGAATGCA 
ATTCTGCAGA 
TTGATATTGA 
GTGGCTCTAA 
ACCGTTATGA 
CAGGAGTTTC 
TTGTAGCTTG 
ACACTGCCTT 
AAAAATCTCA 
CTGTATGTAC 
TGGATTTATC 
TCAATTTCTA 
ATCTCTATCA 
TATAGATTCA 



ATATTCCAAC 
AGAATACCTT 
AATCAAGCGT 
AGTAGGAGAT 
GGGTGCGGAT 
TGAAATGAAA 
ACCAAATATG 
GGCCCAGTTC 
TTACCAAATT 
GAAATTAGAC 
GAACCTCAAG 
CTATATCTCA 
AATCAATTGG 
TGCGGACGGG 
AGCCATCCGC 
GTTGACATGT 
GCTTGCACCA 
GGATATGGGT 
GGCTGCTGCT 
GTTTAATAAT 
TCCAGACAAT 
TCAGTATCAC 
TGTTACTGCA 
GAGGTTTTTT 
CTGCTACCGT 
AGTATATTGA 
GCAAAGTTTT 
GGAAAGTCAA 
ATACATTATA 



GTGACAACTG 
GGGGCAACTG 
GCCGCTCTAA 
ATCGAGTCCT 
AATGTCATGT 
ACCAAACCAA 
TTGGTTATTC 
TGTGATGTGG 
CCACTGAACT 
GCACCAGCAG 
AAACAAGTTA 
GTGGTCGAAG 
GTCAATGCCA 
ATCATCGTAC 
TATGCGCGTG 
ATCGAGTTTG 
GAAACAAAAT 
GGAACCcTTC 
GCTTATCACA 
GCCTTCCGTG 
CGTTTGGTAG 
CCTGAACTGT 
GCAGTTGAGA 
GCATACGATG 
TGAAATCAAT 
AATGAAATAA 
AGGAACTGGA 
ATTAATTTAT 
CTTTTTTAAT 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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TTAATCCACT ATAGTAAAAT GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 2340 

ATTTCTAACA ATGTTTTAGA AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 2400 

ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT 24 60 

TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT 252 0 

AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 2580 

TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG 2640 

ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 2700 

GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG 2760 
AAGA 

(2) INFORMATION FOR SEQ ID NO: 241: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



2764 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 



CCGTTTTTTT 


CATTGTTCAG 


TACTACAACT 


TACGTTGTAG 


CGCCCTGCAC 


ATTGGTTCGT 


60 


CTTGTTCAGT 


TTTCAAAGGT 


CTTTGTCACT 


TGCTTCTCTC 


AAGCGACAAC 


TATATTAGTA 


120 


TATCACAACT 


GCTTTCGCTT 


GTCAACACTT 


TTTTGAAGAT 


TTTTAAGTTT 


TTTTAAACTT 


180 


TTTTTCATCA AGTGGTCCTG 


ACGCAACATA 


CCATAGTCCG 


TACGGGATTC 


GAACCCGTGT 


240 


TACCGCCGTG 


AAAAGGCGGT 


GTCTTAACCC 


CTTGACCAAC 


GGACCTGAGT 


TGTTATTTTC 


300 


AACTCTTACT 


ATTATACAGT 


CTTTTCAAAC 


TTTGTCAACT 


ACTTTTTTAA 


ACTTTTTTTA 


360 


TTAATTTTAC 


AACAGCTTCA 


GTTCGAGCTG 


TATGTGGGAA 


CATATCGACC 


GACTGGATAT 


420 


AATGAAGATC ATAGACTTCT 


ACTAAGCGTA 


CCAAATCACG 


AGCCAAGGTC 


GAAACATTAC 


480 


AAGAAATATA AACCATTTTT 


TCTGGTACAT 


AAGTAAGAAT 


AGTATCTAAT 


AACTTATCAT 


540 


CCAGACCTGT 


ACGTGGTGGG 


TCAACAATCA AAGCATCTGC 


TCGGTAGCCT 


TCCTTGTACC 


600 


AACGAGGAAT 


AATCTCTTCT 


GCCGTTCCAG 


CTTCATAATG 


AGTATTGTCA 


AATCCCATTC 


660 


TTTTAGCATT 


TCGCTTGGCA 


TCTTCAATAG 


CTTCTGGAAT 


AATATCCATA 


CCTCTGAGTG 


720 


TTTTTACTTT 


CTTTGCAAAG 


GCAAATCCAA 


TCGTTCCAAC 


TCCACAATAA 


GCGTCAATCA 


780 


AATGGTCTTC 


TTTATCAACA 


TCCAGCGCTT 


TTACTGCTTC 


GCTATAGAGG 


ACTTCTGTTT 


840 


GCTCAGGATT 


TAGTTGATAA 


AAAGCTCGAG 


GGGATAGTGA 


AAATTCATAA 


TTGAGTACAC 


900 
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CTTCTTGAAT ACTCTCTTGC CCCCAGATAA TCTCTGTCTT TTCACCATAT ATCTCACTGG 960 

TTTTAGCTGT ATTTGTATTA ACAGCTACTG TCACAACTTC TGGGAAATCT TTAACCAACT 1020 

CTTTTACCAA TTGAGTTAAA TTAAGCTGGC GGTTTGTAAC AATAATAATC TGAACCTGTC 1080 

CGGTCTTTCT CGCGCGTCGG ACCATAATAG TACGGACACC TAGAACTTTT CTCTCATCCG 1140 

TGATTGGAAT CTGGTGATAA GTAAGTAATT CTGCTAAGCG ATTAGCAATC ACTTGGGTTT 1200 

CCTTATCTTG TACCAGGCAG TCTTTCAACT CTACTAAATA GTGAGAGTTT TGTGCATATA 1260 

AGCCCGCCTT GACCTGATTT TTAAATTTTC GAGTCTGAAA TTGTAACTTA GCTCTGTAAT 1320 

ATTTTGGTTC CTGCATTCCA ATAGTTGGAC GAATTTCATA ATTTTCATAT CCTGCAGGAG 1380 

CAAATTTTTT CAGCGCTTGA TGAAGTAAGT CCGTCTTGAA CTCCAGCTGC TTATCATAAT 1440 

GCAGGTGCAT GATTTGGCAG CCTCCGCATT CATTATAAAT AGTACAAGAT GGCACAATTC 1500 

GAAATTTAGA CTTCTTGTTG ACCTTCAGTA ATTTTGCTTC AACAAAGTTG CGTCTAATAG 1560 

AAGTAATCTG ACAATAGATA TCTTCGCCTT TGAGAGCTCC TGGTACAAAG ACTAATGTTT 1620 

TTTGGTAAAA GCCGATTCCC TCACCGTTAA TTCCCATGCG CTTGATTTTT AATGGTATTT 1680 

TT 1682 
(2) INFORMATION FOR SEQ ID NO: 242: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TTAACTTTGG TCAATTCTTT AAAGTCATCC TCTGTAAGCA TGTCTAACCA TTGATGTTTC 60 

CCTTTATTGC TAAAATCACC AATTCCGACT ACAGCTATAT CTAAATCTTT CCAACTATTT 120 

TTCAAATTTT CAAAATATCT TGATTGCAAA ATACCATCTG CTAACAATTT ATTTTCTTGC 180 

ACAATCGTTG CATTCATAAA TGTACACTCT CCATGAAATT TTCTAGACAT TTCATAAATC 240 

AGTGTATTCA CATGGTATTT AGCGTGTATG TGACTAGGAC CACCTGCTAG AGGATAGAAG 300 

TGAACATTTC GGACACTTTT ACTGTGAATT AAATCTACTA AATTACTTAA ACTTTTCCCC 360 

CAAGAAAAGC CAATTTTCAT ATTATCATCA ATTAGATTCC TAAGGACGCC TGCTGCAACT 420 

TGAGAAATTC TTTCAGATAA AATTGTTGGA GTATCATCAA ATTCATTTGG AATAATTTCT 480 

AAACTTTCCA AACTGTATTT TTCTTTTACA TAATTTTCCA ACTTAAACAT ATTGGTATCA 540 
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AAATTCTCTA TTTCAATTTT AACAATTCCT ACATTCCTTG CTTCTGTTAA CATTCTACTA 600 

ATAGAGGTTC TATAAATTCC TAATTTTGCT GCTATTTGTG ACTGATTTAA GTTTTCAATA 660 

TAATACAGAT AAGCAATTTT AGAAAGCAGT TTATTCCTAT CTTGATTCAT ACACTTAACC 720 

TCTTACGAAA CTACCTTAAC CATTATCCCA GCATTTTCTA ATGTAGCTAT ATTTTGTTTA 780 

GAAAGTTTTT CGTCTGTTAT TACTTCATAG ACTTGACTTA AAGCAAATCT TCTTACTGTA 840 

CCTCTTTTAT CAAATTTACT TGAGTCAGTT AGGACAATGA CTTTATCCGA CACTGCTGAA 900 

ATATATTGAA CTACCTCACT GCGCATTAAA TCTTTTCCGG TAAAGCCCAT CTCTTTATCG 960 

TAACCATCTG TCCCAACAAA AGCTTGACAC ACATGAAAAG TCTGTATCAT TTCTTTTAAT 1020 

AAAGGTCCTA CAGTCACCTG TGAATCTTTC TGAAACTCAC CACCAAGAAC AATAACACGA 1080 

CATGAATCAT AAGCTCTCAC AAAATTTGCT ATAAAAAACG AATTTGTTAC AATCGTAACA 1140 

TTTCTTTTTT GCTTGCAAAT TTCCTCAGCA AGTAAAGCAC AGGTCGATCC AGATTCTATC 1200 

ATTATTGTTT CATTATCTGA CACCAATTTT ACTGCTTCCT GAACAATTTT TCTCTTAGTT 12 60 

TCATAATTAA TTGACAAACG TACATTTAAG TCATCTCCAC TATTTAATAC AGCATATCCA 1320 

TGCTCTCTGT GTAATAAACC TTTTGACTCT AATTTATCTA AATCT TTTCT AATCGTTACT 1380 

TTCGATACAT TTAATTTTTC CGATAATGTA TTAACGTCGA TCTTTTCATA TTCTGATACT 1440 

AATTTAATAA TTTGTTCCAA TCTTTTCATT TTACACCTCC GTTTTATTCT ACCAAAATAA 1500 

AAAGCAAAAA ACAACAAATT AACCTTTCGT TCGTAATTGT TTTTCTTTCG TTTTTGTGAT 1560 

AGGATAGACT TATGAAGAGG AGGAACTCTT ATGGAAATAT CTAAAGGAAT TATTTTTAAT 1620 

ATTCAACACT TTTCAATTCA TGACGGTCCG GGTATTCGTA CAACTGTTTT TTTAAAAGGA 1680 

TGTCCTCTGC GCTGTCCATG GTGTTCTAAT CCTGAATCTC AAAGAATGAA ACCTGAAAAA 1740 

ATGAAAGATG CTCAACGAGA GAAATTCACC TTAGTCGGTG AAGAAAAGAC TGTAGAAGAA 1800 

ATT ATT AC AG AGGTATTAAA AGACAAAGAA TTTTACGAAG AATCCGGTGG AGGTTTAACT I860 

TTATCAGGAG GTGAAATATT TGCTCAGTTT GAATTTGCTA AAGCCATCTT AAAATCAGCT 1920 

AAAGAACATC ACATACACAC TGCCATTGAA ACTACTGCCT TTGTTGATCA TGAAAAATTT 1980 

ATTGATTTAA TTCAATATGT GGATTTTATC TACACAGACC TAAAACATTA TAATTCTATA 2040 

AAACATAAAA AAGTGACTGG GGTTTTTAAT CAAATGATTA TTAAAAACAT TCATTATGCT 2100 

TTTTCACAAA ATAAAACTAT CGTTTTAAGA ATCCCAGTTA TTCCTAATTT TAACAATAGT 2160 

TTAGAGGATG CAGAAAAATT CGCTACTCTA TTTAACTCAT TAAATATCGA CCAAGTTCAA 2220 

CTACTCCCTT TTCATCAATT TGGTGAAAAC AAATATCGTT TATTAAATCG GAAATATGAA 2280 

ATGGATGGAA TCAACGCACT TCATCCwGAA GATCTTATTG ATTATCAAAA GGTATTTCTG 2340 
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AACCACCATA TTAATTGTTA TTTCTAGTTT ATTTCCTTGA AATGCTCTAG CTATTTGCAG 
ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 
CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT ATCACAAAAA 
AACG 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



2400 
2460 
2520 
2524 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

CGTGCTTGGG GGCTTGTGGT CAAAAGGAAA GTCAGACAGG AAAGGGGATG AAAATTGTGA 60 

CCAGTTTTTA TCCTATCTAC GCTATGGTTA AGGAAGTATC TGGTGACTTG AATGATGTTC 120 

GGATGATTCA GTCAAGTAGT GGTATTCACT CCTTTGAACC TTCGGCAAAT GATATCGCAG 180 

CCATCTATGA TGCAGATGTC TTTGTTTACC ATTCTCATAC ACTCGAATCT TGGGCAGGAA 240 

GTCTGGATCC AAATCTAAAA AAATCCAAAG TGAAGGTCTT AGAGGCTTCT GAGGGAATGA 300 

CCTTGGAACG TGTCCCTGGA CTAGAGGATG TGGAAGCAGG GGATGGAGTT GATGAAAAAA 3 60 

CGCTCTATGA CCCTCACACA TGGCTAGATC CTGAAAAAGC TGGAGAAGAA GCCCAAATTA 420 

TCGCTGATAA ACTTTCAGAG GTGGATAGTG AGCATAAAGA GACTTATCAA AAAAATGCGC 480 

AAGCCTTTAT CAAAAAAGCT CAGGAATTGA CTAAGAAATT CCAACCAAAA TTTGAAAAAG 540 

CGACTCAGAA AACATTTGTA ACACAACATA CAGCCTTTTC TTATCTAGCG AAGAGATTTG 600 

GGCTTAATCA ACTTGGTATT GCAGGTATCT CTCCTGAACA AGAACCAAGT CCACGACAAC 660 

TAACAGAAAT TCAGGAATTT GTTAAGACCT ATAAGGTTAA AACGATTTTT ACAGAAAGTA 720 

ACGCTTCTTC AAAAGTAGCT GAAACTCTTG TCAAATCAAC AGGTGTGGGT CTTAAAACTC 780 

TGAATCCTTT AGAGTCAGAC CCACAAAATG ACAAGACCTA TTTAGAAAAT CTTGAAGAAA 840 

ATATGAGTAT TCTAGCAGAA GAATTAAAGT GAGGAAAGAA TGAAAATTAA TAAAAAATAT 900 

CTAGCAGGTT CAGTGGCAGT CCTTGCCCTA AGTGTTTGTT CCTATGAGCT TGGACGTTAC 960 

CAAGCTGGTC AGGATAAGAA AGAGTCTAAT CGAGTTGCTT ATATAGATGG TGATCAGGCT 1020 

GGTCAAAAGG CAGAAAACTT GACACCAGAT GAAGTCAGTA AGAGGGAGGG GATCAACGCC 1080 

GAACAAATTG TTATCAAGAT TACGGATCAA GGTTATGTGA CCTCTCATGG AGACCATTAT 1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA 
GATCCGAATT ATCAGTTGAA GGATTCAGAC ATTGTCAATG 
ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG 
CGGACAAAAG AAGAGATTAA ACGTCAGAAG CAGGAACGCA 
GCAGATAATG CTGTTGCTGC AGCCAGAGCC CAAGGACGTT 
ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG 
tSACCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA 
GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA 
CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG 
AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTGT 
CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC 
AGAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA 
GAATTGGAAA AACGAATTGC TCGTATTATT CCCCTTCGTT 
CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC 
CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA 
GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT 
TCAGCAGAAA CAGCAGCAGG CATTGATAGC AAACTGGCCA 
AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC 
TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA 
TTTGAGGCTT TGGATAACCT GTTGGAACGA CTCAAGGATG 
TTAGTGGAAG ATATTCTTG 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



GTGAAGAGCT 
AAATCAAGGG 
CAGCTCATGC 
GTCATAATCA 
ATACAACGGA 
CTTATATCGT 
GCGAGTTAGC 
GTTCTAGTTA 
TCACTCCAAC 
ATGCTAAACC 
AAATCACAAG 
TCCCTTATGA 
ATCGTTCAAA 
CAGAACCTAG 
AAGAAGCTGT 
ATATCCCAGC 
AGCAGGAAAG 
GAGAATTTTA 
ATAAAGGTCG 
TCTCAAGTGA 



CCTCATGAAA 
TGGTTATGTC 
GGATAATATT 
TAACTCAAGA 
TGATGGGTAT 
TCCTCACGGC 
TGCTGCAGAA 
TAATGCAAAT 
TTATCATCAA 
CTTATCAGAA 
TCGAACCGCC 
ACAAATGTCT 
CCATTGGGTA 
TCCAAGTCCG 
TCGAAAAGTA 
CAAGGATCTT 
TTTATCTCAT 
CAATAAGGCT 
ACAAGTTGAT 
TAAAGTCAAG 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2359 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 
TTCTTTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AGTTGTTAAA 
TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 
AGCGATTCAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 



60 
120 
180 
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ACTAAGGTTA CGGATGCCCT GATTAAATAC TACCGCGACT ATGTTGCGGG TAACGATATT 
AGCAAGAACC AAAGCTGGAT TATCGACCGC TATGCTGCTA TGGTTAGTGA ATTGGGACTA 
AAACCAGCTG TGCTAGAAAA AATTTCTAAA AGCATTCACG CCTTGGCCAC TCTTCCTATT 
GAAGAAAATG AATTTCTCTA CGATACTTTC CTAGCAGCCG GTGAAAATAA CAATGCCAAA 
TTGATTGCTG CCTACTTTAA CCAAAATGGT ATCGATGCAC GCTATATGCA CCCTAGAGAA 
GCTGGGATTG TGGTCACAAG TGAACCTGGT CACGCTCGCA TCATTCCATC AAGTTATGAC 
AAGATTGAAG AATTGACAAA CACCAATGAA GTCCTTGTCA TTCCTGGTTT CTTTGGTGTC 
ACTAAGGAAA ATCAAATCTG TACTTTCTCA CGTGGAGGTT CTGATATTAC AGGTTCTATC 
ATTGCTGCTG GTGTCAAAGC TGACCTCTAT GAAAACTTTA CGGACGTTGA TGGTATCTTT 
GCAGCCCACC CTGGTATTAT CCACCAACCA CACTCGATTC CTGAGTTGAC CTACCGTGAA 
ATGCGCGAGT TGGCCTATGC AGGCTTCTCA GTCCTTCATG ACGAGGCTCT TCTTCCTGCC 
TACCGTGGAA AAATTCCTCT GGTTATCAAG AATACCAACA ACCCTGACCA TCCAGGTACT 
CGTATCGTTC TAAAACACAG TAATGATGAA TTTCCAGTTG TGGGAATTGC TGGTGACTCA 960 
GGCTTTGTCA GCATTAACAT GTCGAAATAC CTCATGAACC GTGAGGTTGG ATTTGGCCGC 1020 
AAGGTTCTGC AAATCCTGGA AGAACTTAAC AT 1052 
(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 60 

AATCGTGAAG AGATTCTGGA AGGTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 120 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAAAGAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 240 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC CAAAGAGCGA TGGAAGGGAA ATTAGTGCCG CAAAATCCCA 360 

ATGACGAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA AAACTTATCA 420 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GATGATGGGA 480 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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AACATTATGG GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 

CTGATACTTG GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGGG CAAAATAAGC 

CAGAGAAATC CTTTAGGTAT ATAGATACGT CTAGTATTGA TAGAAAAAAG AACATAATCA 

ACTACAAAAA TCTACAATAT CTTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 

TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 

TTAGAGAACT TAAAGAGTAT TTGATAGCTA GTACAGCATT TAATGTTTTG GGATACTTTA 

" CTTAACGAAA CATAT 

(2) INFORMATION FOR SEQ ID NO: 246; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
TTTAGGAAGG CTATCCGTAA TTTTACAAAG GATTTAGATA TTACAGAGGA ACATTTAGAT 
ATTATCAAAA GAGAGATGTT TGGCGAATTT TTCAGTAGCA TGAACTCTCT TGAATTTATT 
GCAACGCAAT ATGATGCTTT TGAAAATGGT GAGATAATTT TTGATTTGCC GAAAATTTTA 
CAGGAAATTA CTTTAGAGGA TGTCCTTGAT GCTGGACATC ATTTAATAGA TGATGGTGAC 
ATAGTTGATT TTACAATATT CCCATCGTAG TAACCTATTA TAATAGACAC TAGAAAGAAG 
GGATGACAAG TATGAGAAAA AAAACAATTG GAGAGGTTTT ACGATTAGCT AGAATCAATC 
AGGGATTGAG TTTAGATGAA TTGCAGAAAA AGACAGAAAT CCAGTTAGAT ATGTTGGAAG 
CAATGGAAGC AGACGATTTC GATCAACTTC CAAGTCCTTT TTACACGCGT TCTTTCTTGA 
AAAAATATGC ATGGGCTGTT GAGTTAGATG ACCAAATTGT TTTGGATGCT TATGATTCTG 
GGAGTATGAT TACTTATGAG GAAGTAGATG TTGATGAAGA TGAGTTGACA GGTCGTAGAC 
GTTCAAGTAA GAAAAAGAAG AAAAAAACAT CATTTTTACC TTTATTTTAT TTTATCCTGG 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCGGTTGCAC 
ACAAGTAATA 
AGCATGCTGA 
TAGAAAATCG 
ACAAATATGA 
GCCAGCATGG 
ACTTCTCCAA 
ATATTATCCA 
AAGGACAGAA 
ACTCCCATCA 
TTAATTAATT 
CCGCGAACAA 
AACCTTTTAG 
GTATTTTCTC 
GATTTTACTA 
TAACATGTTT 
TAGAGCCCTC 
TAACTCTCAC 
GCTTGATGGC 
AGGATTATCC 
GATGCGTCAA 
TATGCAATTT 
GAAAGTTGTC 
TTCTGATATG 
GAAAAAACTG 
GCTTGTCGAT 
TAAATCCCCT 
ATAATGAAAA 
CACAAAATTA 



AGGATCGTGC ATAGTCAACT 
ACACCTAAAA TGAAGCTTTT 
GGTAAAAAAC GCTCATCATA 
TCAAATAGGC TGAAAAGACA 
ATCCTTCACG CAAAAAAGGA 
TCCGTTTGAT ATTCCCTGTC 
AAGCAGTTGT CACCAGTCCC 
CAGTTTGCGG CACAAAAGCA 
TAGGTTTTTT CACAATTCTC 
TAAACGCTAG CAAGGTGAGA 
CTACTGAAAG AAAGACAACA 
TAAAAGTGTA AGCATCCACA 
ACTGACGTGA TATTTTTCTT 
TTAGAAATAT TGTACCATTT 
TTAGCATAAA AATAATAATA 
GCAAACAAAG CATACGAACC 
TTAGCAAAAA TCATTATTTT 
CAATAAAAGA CTATGTCTTA 
TATGCTACTA ATAACAATTA 
CCTTGAGATG AAAGGAACTT 
GCACAAAAAC TTCAAAAACA 
GTTGGCAAAT CTGCTCAAGA 
AGCATTGATT TCAATCCAGC 
ACCGTTCAAG CCATCAACTC 
GGTGCTTTCG CTGGGAAATT 
AACAAAGGCT AAGAAAGGTG 
ATGGATATTA TGGAGCCTAT 
GCGACAAAAC AACTCATTAG 
CTAGACATTA AAGACCCTAA 



1269 

CTTCAAGTAT AGCATATCTC 
TCTTTTACTT TTTTCTGCCA 
ATAGGAACAC CAAGAATGGT 
ACGCCAAGGA CAAAACTACT 
GTGTGCTTGG TTCGGAAATA 
ATAAAAGCGT TATTATAGGC 
ATACAGAAGG CCAAGGGCGG 
ATAATGATTG ATAAGATTGC 
AATTTTTCCT TATAAATCGT 
ACCTTGTCCC TAACATCCGA 
TTTCCAGTTT GTCCAGCTAC 
TATCCAGCAC AAAACGTCAA 
ATAGGTAATA ACCTCATTTT 
TCTTTCTAAA AAATCGTAGG 
GACAACTATT TATCCAAAAA 
TTTAGTAAAA TCATTTCCAT 
AATTTATTTC TAATCACTCC 
AAAAAATGGT ATAATAAAAT 
GGAGAGAAAA TCAGGCACTT 
TAGAAATCTT ATGATGAACA 
AATGGAACAA AGCCAAGCTG 
TCTTGTCCAA GCGACCTTAA 
TGTCGTTGAC CCAGAGGACC 
TGCTCTTGAA CAAATCGATG 
ACCTTTCTAA AAACAAGGAG 
CAAAAATGAC TCTATAATAT 
TTTTGTGTAG AAAAAAGTCC 
AAAGAATCAT ATGGAACAAT 
TATCCAGATT TTAGACATCG 



CTATTTTCTT 60 

AGAGGCAAAA 120 

CTTTTCATGA 180 

AAGCAGGCTA 240 

ATCTCCAAAA 300 

AATACCCGAC 360 

CACTAGATAG 420 

CAAGGGAATC 480 

TAATAAAAAG 540 

AACATTATTT 600 

AAGGGTATTC 660 

AAAAAGTGCT 720 

ACCTCCCATT 780 

CTACCATTTA 840 

TAGATAGATG 900 

GAAACTAGAA 960 

TTGACATAAA 1020 

CAATACTTGG 1080 

GTTAACAACA 1140 

TGCAAAACAT 1200 

AACTTGCTGC 1260 

CTGGCGATAA 1320 

TTGAGACTCT 1380 

AAACTACCAA 1440 

CTAGAACAAT 1500 

TTGTAGTGGG 1560 

CATATGACCT 1620 

TACATTTTAT 1680 

TCAATAAGGA 1740 
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TACACACAAG GwAATCATCG CCAAACTGGr CTATGAAGCT CCATCTTGTC CTGAGTGCGG 1800 
AAGTC 1805 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

CTGCATCTAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA GATTGGAGAT TATGATTTAA 60 

CGTCGCCGCG TTGGGGTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 120 

GAATTAATAA TAATGACAGC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 180 

CCTATAGTGG GTCAGATATT ACCCCGGTAT ATTCATTGGG GTCTGGCTCT AGGATTGTCA 240 

TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACTAGATTA GCTATGGCGC 300 

CACAATTTGG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA GATTCTTATT 3 60 

AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 420 

ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 480 

TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTGACCC TTCTTTGCAT 540 

GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 600 

TTTTGCCGCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 660 

TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 720 

ATTGTGTCGA ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTC 780 

TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 840 

TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 900 

CAATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTTGGG GCGCTGTGTT 960 

CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 1020 

CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 1080 

AGAACCTGTG ACAACGCTAG CTTTGGGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 1140 

TGCTCATCTT GCTTTTATTT TATTTGTTAG TTGTTTGGTT GTTTATGGTA CATTTTTTCT 1200 

ACATTCAGAG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 1260 

GCAGTATTGG TTTATTTTAT ATTACTGCTA TCCTGTGTTT TATCTAGTAC AGTATATTTA 1320 
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TTGCGCtGTC GCCAATATTC AATCCATCCA AATGTATTAG AATGGATCTT AGTTTTACTT 1380 

CAAGATATGA CGACTGGAGT ATATTGCTTT CGGTTCACAT ATATATTGTT CTTTTTTTAT 1440 

TTGATGAATA ACTATTTTAA TAGGTTGGAG TGTCGCATTC GTCTGAAATC AATTAAGCAC 1500 

TTTACCAGTT TTAGTTTCAA ATTAGCAGCT CTTAGTACGG GGATTTGGAC GGCGACTTTA 1560 

TTTTTATTGA TTTTTCTAAT TGCATTTAGT AATGGTTTTA GCTTCTCTTT GGAGATAAAG 1620 

GAGGTTGATT TTTTAAGAGA ATTTTATGGT ATAAGTATTG CAAACAATGC TAGTTTCTTT 1680 

ATAGGATTTT TTTTCTCTTA TATAGCATAC TATTTCTTTT TATCCTTACT TACTATTAGC 1740 

AGTTTTTCTT GGTTTAAAAA ATCAAACATG AGCTTAGTAT TTCTGTTTAC TTTTTTATTT 1800 

GTAGAATCCT TATTCTGGAT TTATCAGTTG GACAATGGGA TAATTGGATT ATTGCCAATT 1860 

TTTCAGTATA TGGTAAATTC CAATCCGTAT GCATTGATTT ATTGGCTTAC ATTACTATCT 1920 

ATCATAATTC CATTGACTGT ATTTTCTGTT CATAGAAACT GGAGGAGAGT GTAAAAGTTG 1980 

GAAATGGGAA AGTTAAGTAG TCACATGTGG AGGTTGAATC AGATAATCTA TACCAAGTAC 2040 

TTTTGGGGTT ATGTTCTTTT TTGGATATTG ATTTGTTTAG GATTATGGTA TTGGTTAGAA 2100 

GGAAATGATA GACTTGTTAT AGAAATTTTA AAAGGGCCTA ATCTGAGTCA AAACTCTTTT 2160 

TTAGTCTTAT CTATATGGTT GCTTCATTGG TTTATTATTC ATACATTTTT TCTAGCAGTT 2220 

GTATATCGTA GAAGAGCATC CGATTTCTTT ATGGAAGTGA TTCGATTTTC TTCTATTAAG 2280 

CTCTGGATTA GGTATCAGAT TTGGACCTGT TTTCTTTATG GACTCATTTT AATCATGGTA 2340 

AAAGTTCTAG TGATTCAATT TATGTTACAG TTACCAAACT GGGATATAGG AGTTTTGTTT 2400 

ATAGTTGATT CTTTGAATGC TTGTGTGTTA GTCTTGTTTT GCTTTATGTT ATACGCACTA 2460 

GGAGCGAATG TACAAATGAA CTTTGCTTGC GTTAGTTTCT TTTTACTCAT GATTGG 2516 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CGGTGTTTTT TTGTAAATTT TCTAGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAAGAA ATACCAACAA TTATTTAAGC 120 

AAATCCAAGA AACCATTCAA AACGAGACTT ACGCTGTCGG AGATTTCCTT CCTAGCGAGC 180 
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ACGACCTTAT GGAGCAATAT CAAGTGAGTC GTGATACCGT CCGAAAGcCC TGTCTCTCCT 240 

CCAAGAGGAA GGATTGATCA AAAAGATAAG AGGGCAAGGT TCTCAAGTCG TCAAAGAAGA 300 

AACCGTCAAT TTCCCTGTAT CCAACCTAAC CAGCTACCAA GAACTAGTTA AAGAACTTGG 360 

ACTGCGCTCT AAAACCAACG TGGTCAGTCT GGACAAGATT ATTATTGATA AAAAATCCTC 420 

ACTGATAACC GGTTTCCCAG AGTTTCGGAT GGTTTGGAAG GTGGTCCGCC AGCGTGTGGT 480 

GGATGATCTG GTATCCGTTC TGGATACGGA CTATCTGGAT ATGGAACTCA TCCCAAATCT 540 

CACTCGCCAA ATTGCTGAGC AGTCTATCTA TTCTTATATA GAAAATGGCC TCAAACTCCT 600 

TATTGATTAT GCTCAGAAGG AAATCACCAT TGACCACTCA AGCGACCGAG ACAAGATTCT 660 

CATGGACATT GGCAAAGACC CTTATGTCGT TTCGATTAAA TCAAAAGTCT ATCTCCAAGA 720 

CGGACGCCAA TTTCAGTTTA CCGAAAGTCG CCATAAGTTA GAGAAATTTA GATTTGTAGA 780 

TTTTGCAAAA CGCAAGAAAT AAAAGACTGA GACACCAGAT CTCAGCCTTT TTCGGCTCTA 840 

TAATATTTGT AGTGGGTAAC CCCCCTATGG ATATTATGGA GCCTATTTTG TGTAGAAAAA 900 

AAGTCCCATA TGACCTATAA TGAAAAGCGA CAAAACAACT CATTAGAAAG ATTCATATGG 960 

AACAATTACA TTTTATCACA AAACTGCTCG ATATTAAAGA CCCAAACATC AAGATTCTAG 1020 

ACATCATCAA TATGGATACC CACAAAGAAA TTATCGCTAA GCTGGATTAT GAGGCTCCAT 1080 

CTTGCCCTGA TTGTGGAAGT CTAATGAAGA AATATGACTT TCAAAAACCG TCTAAGATCC 1140 

CTTACCTCGA AACAACTGGT ATGCCTACTA GAATTCTCCT TAGAAAGCGT CGTTTCAAGT 1200 

GCTATCATTG TTCTAAAATG ATGGTCGCTG AAACTTCTAT CGTCAAGAAG AATCATCAAA 1260 

TTCCTCGTAT TATCAACCAA AAAATTGCGC AAAAGTTGAT TGAGAAGATT TCTATGACCG 1320 

ATATTGCTCA TCAGCTGGCC ATTTCAACTT CAACTGTCAT TCGG 1364 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 60 

GACAATCGGT GGAGGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 120 

CATCGGAGAA GTGCAAACAA GTGTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 180 

TTTGTAGAGA ATCGAACCGC AAGGTTCGGT TTTCTTTCTC TTTTTGTCTA TAATTTGGTA 240 
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TAATAAACAG TATGAAAATC GTATCAGGAA TCTATGGGGG ACGTCCCCTC AAGACACTAG 300 

AAGGCAAGAC GACAAGACCT ACTTCGGATA AGGTTAGGGG AGCCATTTTT AACATGATTG 360 

GTCCCTACTT TGAAGTGGGA CGAGTCTTGG ACCTTTATGC AGGTAGTGGT GGTTTATCTA 420 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTGTTTTGGT GGAGCGAGAC CGTAAGcTCA 480 

GACCATCGTG GCTGAAAATA TCCAGATGAC CAAGGAAGTT GGAAAATTTC AACTCCTCAA 540 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA TTTGACCTCG TTTTCTTAGA 600 

CCCTCCCTAT GCCAAGGAAC AAATCGTAGC AGATATTGAA AAAATGGCTG AGAGAGAGCT 660 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA GCCGTTGAAC TTCCAGAAGA 720 

AATTGCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA ATTAGTAAGG TGACAGTCTA 780 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC GATGACAAAT GGGCATCTGG 840 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA TGTGGGTATT TTTTTTAATC 900 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAACG GGGGTTAGAA AAGGCTGTGA 960 

AACATTTGGG AAATGTTAAA GTCGTGTCTT CTCATGATAA ATTGGTGGTC GATGTCGCAA 1020 

AAAGACTGGG GGCTACTTGC CTAGTGCGAG GTTTGAGAAA TGCGTCGGAT TTGCAATATG 1080 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA TATAGAGACT ATTTATTTAC 1140 

ATAGTCGACC TGAACATCTC TATATCAGTT CATCAGGCGT TAGAGAGCTT TTGAAGTTTG 1200 

GTCAGGATAT TGCCTGCTAT GTTCCCG 1227 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT TAAAAACGCT ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACGT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 120 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA 180 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 240 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 300 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA CAAGCCAGTT 360 
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ACACGGCTAA TTCGTCCACC TTTGACAAGA TTTTCCAAAG TTGAAACACC AATATAAAGC 420 

TCTGTATGGT TTTTAACCTC TTCTACATGA GATAAAATTG CCTCCATATC TTTACCTTCT 480 

TGAGCTAACT TCGCAGCCTC AACAACTTGG AATTTCAGGG CTTGGTCAGT GAAGGAACTA 540 

TCAACAACAG TCACATCTGC AGTAGATAGG CTAGCACCTT GGCGTGCTGC TTCTACCGTA 600 

CCCGAAAGAG CATGGGACAT ATGAATAGCA AGAATCTGGC CACCATCTTT GCATAGGTCT 660 

TCAAAAATCT CAGCAAAGAC ACCTACAGGT GGCTGACTTG TTTTCGGAAG ATTCTTACTT 720 

TCTTGCATCA ACTGAAGAAA TTTACCTTCT TCTTTCAAAT CCGCATCAGA ATAAACAACA 780 

TTATCAATCA TTACAGATAA TGGAACAATT GTAATATCTA ATTGCTTTAC TAGTTCAGGT 840 

TCAATAGTAA CAGATGAATC GGTTACAATC TTAATTTTTG TCATAGTATC AATCTTTCTA 900 

TTTTAGGATT CAGATTGGTT TCCTTACTTC TAATTATATC AAAAAAAAGA TTAAAAATCC 960 

TAATGGAGTC AATCAAATTT TCCGTAAAAT TTGATATAAT CAACTTATAA GAAAAGAGGT 1020 

GTCCTATGAT TAAAAAAATT TACCCCATTT TTACCATTTT ACTAGGTGCT GCTATTTATG 1080 

CTTTTGGACT GACTTATTTT GTAGTTCCCC ATCATCTCTT TGAAGGAGGG GCGACAGGCA 1140 

TTACCCTCAT CACCTTTTAT CTTTTTAAAA TCCCTGTTTC CCTCATGAAC CTGCTGATTA 1200 

ATATTCCCCT TTTCATCCTA GCTTGGAAGA TTTTTGGAGC CAAATCCCTC TATTCTAGTT 1260 

TACTAGGAAC CTTAGCTTTG TCCGGCTGGT TAGCTTTTTT TGAGCATATT CCCCTTCATA 1320 

TTGATCTTCA AGGTGATTTA CTAATCACAG CCCTTATAGC GGGAATCCTA TTGGGAATTG 1380 

GCCTTGGAAT TATTTTTAAT GCTGGAGGTA CAACTGGCGG AACTGATATT CTAGCTCGTA 1440 

TTCTCAACAA ATACACTCAT ATATCCATAG GAAAACTGCT CTTTATCTTA GATTTTTGTA 1500 

TTCTCATGTT GATTCTCCTA ATCTTCAAGG ATTTGAGATT GGTTTCCTAC ACGCTTTTGT 1560 

TTGATTTTAT TGTTTCTCGT GTTATTGATT TGATTGGTGA AGGAGGATAT GCCGGCAAAG 1620 

GCTTTATGAT TATCACAAAA CGTCCTGACC AACTTGCTAA GGCGATTAAT GATGACCTCG 1680 

GAAGAGGTGT TACTTTTATT TCTGGTCAAG GCTACTATAG TAAAGAAAAT TTGAAAATCA 1740 

TCTACTGTAT TGTCGGAAGA AATGAAATTG TGAAAACGAA GGAAATGATT CATCGAATCG 1800 

ATCCTCAAGC CTTTATAACT ATTACAGAAG CCCATGAAAT CCTAGGAGAA GGCTTCACCT 1860 

TTGAAAAAGA ATAAAAAGAG GTAATGTCGT GACCTCAAAA GTTAGACTAA ATCATCTATC 1920 

TTTTGGGTTA CAGACAACCT CTTTTTTATT TTATTTACTC AAGCTCTTAA GACCAATTCC 1980 

GAGTTACTTC TTCATCAGCC TTTAACTGAT CCACTAATTG GTCAACTGAG TCAAATTTGG 2040 

TCATATCTCG AATGCGATCA AGCCAATAAA CCATGACGGT TTCCCCATAA ATATCTTGAT 2100 

TAAAATCAAA AATATTGACT TCAAAACGTG CTTCTTCTCC ATCAAAGGTC ACATTTTTCC 2160 
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CGACACTAGC CATAGCACGA TACTTCTGTC TTTGAATCTC AACATCAACA ACATAAACGC 2220 

CATCTGCTGG CATATAAGTA CGGTCTAAAA GCACTAAATT CGCTGTCGGA TAACCAATTG 2280 

TACGACCACG AGCATTACCA TGAACCACCA TACCTCTTGA TGGAAGCGGT GCCCCCAAAA 2340 

GTTTTCCTGC TTCTTTCACA TTTCCATCTA AAATAGCTTG ACGGATACGA GTTGAACTAA 2400 

TCTTTCCTTT CTCATCTTCT ACAGGTGGAA CAATGATAAC TTCTCCATCA AAGTAATTCT 2460 

TTAAATCTTC TGCTGTTTTT TTGTCAGAAC CAAATGTATA ATCAAAACCT GCAACAATAA 2520 

TTTTGGCATT CATAGCCTTG ATATAAGTTG CAAAGAATTC TTGTGCAGTG AGACTAGCGA 2580 

ATTGACTACT AAAATCAAGG AGATATAATT CTTCTACACC TTCGCGCTTT AATTTTCTTT 2640 

CACGTTCAGC AGGGTTCAAA ATATGCAAAA ACAAATCTGG ATGATAAGGC TCTAAAGCGA 2700 

TCTTTGGAGA TTCATTAAAG GTCATAACGA CGATAGGCAA CAAATCCTTT CTCGCAGCCT 2760 

TGTTGGCAAC ACGAAATAAT TCTTGATGCC CCTTATGTAT GCCATCAAAA TAGCCGAGAA 2820 

CAACGACTGA ATCAGATGGT GTGCCAATAT CTTTTTGGTT TTTTATAGGA ATAGTAATAA 2880 

TCATAAAATA ATTATATCAT AGCGATAGCT ATTTCTGGAA CAGAAAATCT GAAATGTTGT 2940 

TTTTTTCACA TGAAGTGTAC CTGTTTTCAA AAAGCACTTT ATTCTATCGT TGCTTAACTA 3000 

TGAACTTTGC AATATTCTTC TCAAAAACTT GTAGGACATC TTCAAAATTT TGCAAGGAGT 3060 

GATTAGACTT GTTCGGTAAC CATAAAGTGT CATACTATGC TTATGTATGA AAAAGCAATG 3120 

CAACTAACTC CTGAGAACTT TAAATTACTA ATTGGTGCCG AAAAGGTAGA ATTTAGAATC 3180 

GAGGTACACC TATGGCTGTA AAATTTACAA AATGAGACAA CTTGGGCAAG ATGTTTGAAG 3240 

AATTTCCTAA ACTCCCTGAT TTGAAGCAAG TCACTTTCCC TAATGACAAA GAAAAAAGCC 3300 

AAAACAGCAA AGAAAAACTA GATGACTGCT TTCCAACAAC TCCCATCTAG TGTGCTTCAG 3360 

ACTGGGCTAT TTTTCTCTCC ATCTGTTAGC TTGGATTCTC AGACCGTTTC AGCTAAAGAA 3420 

TATCTTTTCC CTTATCAGAA GGAACGGCTC AAGCCATTCA GACAAGTGAA GGGACGACAA 3480 

GCCAATATTT GAAACCAGAT AGCAGTTCTT ATAGTCAATT GAAATAAAAT CTGAAGAAAT 3540 

CGAGTAGGAA ACTCATATCA ATGTTTAACA GTGTTCTATT CCAGATTCAT ACTCAATGAw 3600 

AATTAAAGTG CAAACTAGGA AGTTAGCCGC AGGTGATACT TTGGGTACGG CA 3 652 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1276 

(xx) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
GTACCGTGGT GCCAAAGTAC AGCAAGGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 
TTACATCAAA GAAAATGGAA ACTATGCTGA TAAAGAATGG ATTTTCGAGA ATGGTCACTA 
TTATTATCTA AAATCCGGTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 
TTGGTTTTAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 
TAGTCAAGCT TGGTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATGGATTTG 
GGATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATAGCTGAAA AAGAATGGGT 
CTACGATTCT CATAGTCAAG CTTGGTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 
TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGGA AAATAGCTGA 
AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTGGTAC TACTTCAAAT CTGGTGGCTA 
CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 
AGGAAAAACT ACAAATGAAA ATGCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 
TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATGGCTAGA 
TAAGGATAGA AAAAGTGATG ACA 
(2) INFORMATION FOR SEQ ID NO: 253: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 
TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAAGT TTCAGCGACC 
ATCATTTTTG AACAGTGATA GCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 
ATACCAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT 
TGGTTTCCGC ACTCAGGGCA AGATGGGGCG TCGTAGTCCA GTTTGGCGAT GATTTCCTTG 
TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT GTCTAGTAAT 
TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGAGTTGTTT TGTCGCTTTT 
CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 
GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 
AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 
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TTAGTAAAGT AAAACTATTG GAGGATATTT TAATGCCACA ACCTATTGTT CCTGTAGAGA 600 

TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 660 

TTGGCAAGCT TGAAGTAAGT TTTTTTCAAT CTCTCAATCT CGAAATGGTA GAACAGCTTT 720 

TGGATAAAGT GTTGCTCTAT GACAATTCAT CTATCTAGCC TAGGGCAGGT CTATCTCGTA 780 

TGTGGGAAAA CGGATATGAG GCAAGGCATT GATTCATTGG CTTATCTGGT TAAAACCCAC 840 

TTTGAATTAG ATCCTTTCTC CGGTCAAGTT TTTCTCTTTT GTGGTGGACG TAAAGACCGC 900 

TTTAAAGCCC TTTACTGGGA TGGTCAAGGA TTTTGGCTAC TATATAAACG CTTTGAGAAC 960 

GGAAAACTGA CTTGGCCCAG TACAGAAAAG GATGTCAAAG CTCTCACACC TGAACAAGTA 1020 

GATTGGCTTA TGAAGGGCTT TTCTATCACT CCAAAAATAA ATTTATCAGA AAGTCGTGAT 1080 

TTCTATTGAA ATGAGGACTT TCTTTTTAGT TATAATAAAG TTAGGAAATA AGGAGAGGAA 1140 

GCCCATGGAA GAAGATTGAA AATCATTCAA CAACAGAGTG CTACAATTGA TAGTCTCACC 1200 

AATGAACTTG CCCTTCTTCG TGAACAAGTG GCTTATCTAA CGCAAAAGCT CTATGGAAAA 1260 

TCCTCTGAGA AAAGTGTTTG CCCATCTGGA CAACTCAGTC TTTTTGAAGA GGAACAAAAT 132 0 

ATGGAAGAAG ACTCTGACTT ACCCAGTTGA AAGAGAAGAA ATCACCTATA AACGTAAGAA 1380 

AGCTAAAGGG AAACGTCAAG CTCTTCTTGC CCAATTTGAT TCAGAAGAAG TTCATCATCA 144 0 

AGTAGAAGAG AGCATTTGCC CTGATTGTCA GGGAGATCTA AAAGAGATTG GAGCAACCCT 1500 

TCAACGACAA GAATTAGTCT TTATTCCTGC GCAATTAAAA CGAATAGATC ATATCCAACA 1560 

CGCTTATAAG TGCCAAGCAT GCAGTGATAA AAATCCGAGT GATAAAATCG TGAAAGCTCC 1620 

TATTCCTAAA GCCCCTTTGG CGCATAGCCT TGGCTCAGCT TCTATTATCG CTCACACCAT 1680 

CCATCAGAAG TTTAATCTGA AGGTACCCAA TTATCGCCAA GAAGAAGATT GGGCTAAGAT 1740 

GGGTTTACCA ATCACACGTA AGGAAATTGC TAATTGGCAT ATCAAGGCGA GTCAATACTA 1800 

TTTCGAGCCC CTTTATAATC TTTTACGAGA AAAGTTGTTA GAACAAGCTC TTCTTCATGC 1860 

GGATGAAACC TCTTATCGGG TTCTAGAGAG TGATAGTCAG TTGCCTTACT ATTGGACTTT 1920 

TTTGTCTGGG AAAGCTGAGA ATCAAGCAAT CACGCTGTAC CACCATGATC AGCGTCGGAG 1980 

TGGTTTAGTA GTACAAGAAT TCCTAGGAGA TTATTCTGGC TATGTTCATT GTGACATGTT 2040 

GCGGCAGTAA CTTAGGACTT TAGTCCTCTA GTTCTGCCTA TGCGATAGCA GTCCAAGGTT 2100 

TAGGAGTAAG GCGACGCTAA GCTTGGTAAA CTGCGAACAG CTAGAAGCTT ATCGTCAACT 2160 

GGAAGAAGCT GCACTTGTTG GATGTTGGGC GCATGTGAGA AGGAAGTTTT TTGAAGTGCC 2220 

CCCCAAGCAA GCAGATAAAT CATCCTTAGG AGCTAAAGGT TTAGCTTATT GTGATCAGTT 2280 
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ATTTTCCTTG GAAAGAGACT GGGAGGCTTT GCCAGCTGAT GAACGACTAC 
AGAACATCTC CAGCCCCTAA TGGAAGACTT CTTTGCTTGG TGCCGCCGTC 
AGCAGGTTCA AAACTAGGAA GGGCAATTGA ATACAGCCTC AAGTATGAAG 
GACTATTTTG AAAGACGGAC ATCTGGTCCT TTCCAATAAT CTAGCTGAAC 
ATCATTGGTT ATGGGACGGA GTAAAAGAGT CCAGTGGACT CTTTTAGCCT 
TAAAAAAGCG AGGGTGGTTA TTTTCTCAAA GTTTTGAAGG AGCTAAAGCA 
TTATGAGCTT GTTGGAAACA GCTAAACGTC ATCAATTATA GTGCGTTGAA 
TACGCATCGA CTGCTAAAAC ATTTCTATAA ATCAATTTTC CTTTCCTAAT 
ATATCTTATT TCAATCCATT ATAAATAGCG AGAAATATCT ATCCTATCTT 
TTCCAAACGA GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCCTATTTA 
AAGTTGTACA AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC 
CTAGTCTGGA GATTTTTCAA TAGACTTCGT TATTGGACGG TTACAATTTA 
ATCCCATATT ATTCTCCAAT TCTATATTTT ACCTTTCTAA ATGTATAGAT 
ATTATAGCAT ATAACGCAGA TTCCTTTCAA TCGTATGATT TACTGCATTA 
AAAAATAAAG GCAGTCCGAA GACTGCCGAT ATTTATCTCT CATCTCTTTA 
GTAAATAAAT AATTTCCCTA AAGATATGGA AATTATTAAT ACTATAAATA 
AGTTTATAAA TACTGTAAAA ATCCTGAAGT TAATTTTCTA ATAAATATCA 
GTATCTTTTA AATTTTTAGA CAATTTACTA GTTCTATAGA CATGTTTAAC 
TTACAATTCA AAAATTTCAT CTGCCACTTC ATTTAAAAAT TC TAT AT CAT 
AAAAATTATT TTATCCATGG TTTTATACTT ATTAATCAGT TCAGATATTT 
GGAATAATCC ATACCACTTG AAGGTTCGTC AAAAAAGACA AATGGAGAAT 
AACAGATGCT ATTGCAAGCC TTTGCTTTTG CCCTCCTGAT AAACTCATCG 
AATAAATTCG TCCAGGCATA AATCTTTTAA CCCAAATCAT TCATACCTCT 
TGTAACTTAC AAAACCCCTG ACCTCATGAG CCACTTTCTT CCTCCTCATG 
TACTTTCTGC TGTTCCAGTA TCGTTTTTCC TCGCTAGATT TCCTCAAAAG 
TCCCTTGGTT CGTCACACGA TTTTTTCATC TCGACTGTTC TTTAATGCAT 
GCTTTTCTTC TAGGTGGTTC ATAAGGAACA GGAAGATTCA GGTTGACTTT 
GAATAAAGTG CTGAAAACAA TTCGGAATAG GCATAGAGAC TAGACAATTT 
TTGCGTCCTG TTCGAACACA TTTTCCCACC ACGTGAAGAA AAAGATGGCG 
(2) INFORMATION FOR SEQ ID NO: 254: 



AGAAACGTCA 
AGTCAGTTTT 
AAACCTTTAA 
GCGCCATTAA 
GAGCTCAGTT 
AGAGCTATTG 
TCTATAACAG 
CGATTTGTTC 
CTAGAATGTC 
CCGTGGACTA 
CGTGAGTTCT 
TTATATGAAA 
TAACTACCTA 
AATTAAGTAA 
ATTATGGTAA 
CATATTATAA 
ATATGTGTTA 
AGACTCTATT 
GGGAAACAAT 
TTATCATATT 
TCTTGCACAT 
GATGCCTTTC 
CTCAACTAGA 
AGGTCAGTTT 
GGCAGACTCC 
CATTAACGAC 
TCTAATCCTA 
GAGGAGCTGC 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4010 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2789 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

ATGCATCCGT TTGTCAAGCC TAAATTGTAA TTTTTTTCAA TTTAAAACAG AAAAACCCAG 60 

GAAAATGACA TAAAAATATC ATTCCTAGGC CTATTTATGC TATTTCTCTC TGAAAAATAT 120 

GAGTATTCAG TCGGTCAAAT GAAGCTGAAC GAACTCATTT TCCCTCGCCT AATTCAATGA 180 

TTCGATGACA TTGTTGGGCT ACATAAGCAT CGTGGGTCAC GATAATGACT GTTTTCCCCT 240 

CTCGATTCAT CTCTAAGAGA AACTTCAAGA CCAAATCTCT ATTTTCAGGA TCCAGAGAAC 300 

CTGTCGGTTC ATCGGCTAAA ATCAGCTGGC TGGGTTTTAA GATGGCTCTA GCAACTGCAA 360 

TTCGTTGTTG TTCGCCCCCA GACAACTCGG AGACCCTTTG ATGCAAAGTA GCTGATAAAC 420 

CTACTCTCTC TAAAATCTCT TCCACCTTTT TGAGCTTGTC TTTCTTAGGC AATTTCACAT 480 

ATTTCAGCGC CACATGAGAT TGTACTCGAC CGTTTCATCA TCAATCAGGG CAAAATTTTG 540 

AAACAGATAA GAGATATGTT CACGGATTAT TGTTTGCGAC TTAGCAGAAT TAACCGCTAG 600 

ATTTGTCTGA CCAAAAATCT CATACCGTCC GCTATAATCA CCATCTATCA AACCCAATAA 660 

ATTTAACAAG GTCGACTTCC CACTACCACT CTTACCAACA ATAGCTACCA AATCCCCCTG 720 

ATCAATCCTG AGAGATAAGT TATCCAAAAT CACTTTTCCC CCAATGGTTT TGGTAATATT 780 

TTTCAACTCA ATCATAAGAT GCCCCCTTTC AATAACTCTA CTAGACTTCT TTTCTCCATC 840 

CTAGAAGCTA AGCCTAGCAC AAATAGTATA TCCAGACATG TAAAACCTGC AAACAGTAGA 900 

AGTGGTAAGA ACGCATGGGC AAAGAAAATC AAGACTAGAA GAGGGAAACT ATAGCCCAGC 960 

AAGAGCAGAA CGAGGAGAGG ACGGTAGCGA TCGACCAGTT TCCACCCCAT AAACTTCTTG 1020 

GTAATGATAT CCCTGCGCTT CAATAAGAAA GTTGTTACTA GTAAGAAGTA GGAAATCATC 1080 

ATGCTAAGGA GACCAAACAA AGCAAAGAGT AGGTTAAAAT TCCGAACAGC ATCTCGATAA 1140 

GAATCCACTT TCTCTTGTTG AATGGCTTGA ATAGATGAAA ATTTTAAATA ATTTCCATCT 1200 

GACAATTTCT CAACTAACTC TGTAATCTCT TTTTGATGTT GAACCGTATT TTCAATTTTA 1260 

ATCGGATTAT TTAAGCCAGT TGTTGACAGG GAGGCTTTCT CATCCCACAT CATATCAGAA 1320 

TCATTGACCA AGCTAATAAT TGGATTGGAG AGATTTTCCT TTCGCTTATC ACTATATGGG 1380 

AAAAATGACC AATCTCCTTC ATAATAGGCA ATCTCGACAT CCATCTCCTC TATCGTTCGT 1440 
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TTTTGCTGCT CTTCATACTT CATCGAATGA AAGGCAATTA ACTTCCCCAA GAGCTGATTT 1500 

TTATCTTCTT CACCTTTCGT ACTTGCTGGC ATCAAAATAA CTTTTTTAAT ACCGGTATTT 1560 

GGTAGCTTGA ATCCCTTGCT CTTTAGAAAA TTGCGATTGG CATAGTAAAC ATCCACCGTA 1620 

TCTGTTAACT GATATTGCTG AATCTGTTCT GATTGGACAA AATTTTTTAC AGGAAGACTG 1680 

CTACTCTGCA CATAGCCCGC CTGCGTTTTT TCTACCAAAT CCTGATAAAA TCGATAGAAA 1740 

TAATCTGTAG ATTTCCCTGA CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 1800 

AAGGTTTCTA AGGTCAGGTA ATTACCTTGA CTTACCCACT GTTGCTGATA AGCAAGTTCT 1860 

TTGTTTTCTT GTTCTAAACT TCTGCCCACC CCAATCAGTA AGGCCGTCAG TAAAATAGTT 1920 

GTCCCTATTT TCATCACATA ATTGAAGATA AGACCAAATT TGAAAGATGA AAAACCTTTC 1980 

AGCAGAGAGC TGATTGTCAT TTTTTGGATT AAGAGGTAAG TCAACCAACT GATAAAGAGA 2040 

TAAAGCTGCA ACAGCAAAAA ATGAGACAAC CACAGCATAG GAAACAAATC TTTTGGCTTA 2100 

TAATCAAGCA AGAAAAACAC GCCTAGATTG ATCACAAGAG CCCCACCTAG GAGGAGGTAA 2160 

AGGTTGCCTT TTACAACATC AGCTAAAACA GCCCTATCTT GAAAACCAAG TAATTTTTGT 2220 

ACCCCAACTC TTTTCATCTC CATCATCGGT TGATACACTG TCACTAACAC AAGAAGCAAA 2280 

ATAGCCAAGA CAAAAACAAT GGCAGATAAA AGCAAATCTC GATTTATGAC TTCCACTGCA 2340 

CTTTTGTAGG TCGGCTCTAG CAAGGTAGCC TGGTCTATCT TGAAAAAATC GCTCCATTTC 2400 

TGTACAATCC TATCCTTGTC CATCTCTTGT GTAGAAGTTA TCGTATAGCG ACCATTTAAA 24 60 

CTACGAGATG TATCCTTGAT ATAGGTTTGA AAAGTCATAA GCTGAATAGG TTTGGCTTTT 2520 

AGAAAGGTCG GAATCGTACC AAGTTTATTG GAAATTTCTT TATTACTATA GACTCCTTCA 2580 

CCATCTGTGG TAAAATCAAG AGAAGAAATC CCAAACTCTT GGTAGGGGAA GGTATCTTTA 2640 

TCAAAAACAC CAGACTTGAC CACCTCATCA CCACTGTCTG TTTTGATGAT GGAGACTTTA 2700 

TACTCCTTTG ATACATCCTC AAAAAATCGA AGAACAGACG CTGCAGGTTC GTTAATATCT 27 60 

TTCAAATACA AATCCAAAGA ATCTACAGG 2789 
(2i INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

CTGCGAATTT TATTAAAGAT AATGTGTTAA TTACAGCGGC TCACAACTAC TACAGACATG 60 
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ACTATGGGAA AGAAGCGGAT GATATTTATG TTCTTCCGGC TGTTAGTCCA AGTCAAGAAC 120 

CATTTGGAAA GATCAAAGTA AAGGAAGTTC GTTATTTGAA GGAATTTAGA AATTTAAATT 180 

CTAAGGATGC AAGGGAATAT GACTTGGCTT TATTAATTCT AGAAGAGCCC ATTGGTGCAA 240 

AATTAGGGAC TTTGGGTCTT CCTACTAGTC AAAAAAATTT GACAGGAATA ACTGTGACTA 300 

TCACAGGCTA TCCATCATAT AATTTTAAAA TTCATCAAAT GTATACAGAT AAGAAACAAG 360 

TTTTAAGTGA TGATGGCATG TTCTTGGATT ACCAAGTTGA TACTTTAGAG GGGTCTAGTG 420 

GATCTACAGT TTATGATGCT AGTCACCGTG TAGTAGGAGT GCATACTTTA GGAGATGGAG 480 

CTAATCAAAT TAACAGTGCA GTTAAATTAA ATGAACGAAA TTTGCCATTT ATTTAwTCGG 540 

TTCTTAAAGG TTACTCTCTT GAAGGATGGA AGAAAATAAA TGGTAGTTGG TACCATTATA 600 

GACAACATGA TAAACAAACG GGTTGGCAGG AGATAAATGA TACCTGGTAT TATTTAGACA 660 

GTTCCGGTAA GATGCTTACA GATTGGCAAA AAGTCCATGG AAAATGGTAT TATCTCAATT 720 

CAAATGGAGC AATGGTTACA GGTAGCCAAA CTATCGATGG TAAAGTTTAT AACTTCGCTT 780 

CATCTGGTGA GTGGATTTAA TGTTGGAGGA TATATAAAAT GAAGCTTTTG AAAAAAATGA 840 

TGCAAATCGC ACTAGCCACA TTTTTCTTCG GTTTGTTAGC GACAAATACA GTATTTGCAG 900 

ATGATTCTGA AGGATGGCAG TTTGTCCAAG AAAATGGTAG AACCTACTAC AAAAAGGGGG 960 

ATCTAAAAGA AACCTACTGG AGAGTGATAG ATGGGAAGTA CTATTATTTT GATCCTTTAT 1020 

CCGGAGAGAT GGTTGTCGGC TGGCAATATA TACCTGCTCC ACACAAGGGG GTTACGATTG 1080 

GTCCTTCTCC AAGAATAGAG ATTGCTCTTA GACCAGATTG GTTTTATTTT GGTCAAGATG 1140 

GTGTATTACA AGAATTTGTT GGCAAGCAAG TTTTAGAAGC AAAAACTGCT ACGAATACCA 1200 

ACAAACATCA TGGGGAAGAA TATGATAGCC AAGCAGAGAA ACGAGTCTAT TATTTTGAAG 1260 

ATCAGCGTAG TTATCATACT TTAAAAACTG GTTGGATTTA TGAAGAGGGT CATTGGTATT 1320 

ATTTACAGAA GGATGGTGGC TTTGATTCGC GCATCAACAG ATTGACGGTT GGAGAGCTAG 1380 

CACGTGGTTG GGTTAAGGAT TACCCTCTTA CGTATGATGA AGAGAAGCTA AAAGCAGCTC 1440 

CATGGTACTA TCTAAATCCA GCAACTGGCA TTATGCAAAC AGGTTGGCAA TATCTAGGTA 1500 

ATAGATGGTA CTACCTCCAT TCGTCAGGAG CTATGGCAAC TGGCTGGTAT AAGGAAGGCT 1560 

CAACTTGGTA CTATCTAGAT GCTGAAAATG GTGATATGAG AACTGGCTGG CAAAACCTTG 1620 

GGAACAAATG GTACTATCTC CGTTCATCAG GAGCTATGGC AACTGGTTGG TATCAGGAAA 1680 

GTTCGACTTG GTACTATCTA AATGCAAGTA ATGGAGATAT GAAAACAGGC TGGTTCCAAG 1740 

TCAATGGTAA CTGGTACTAT GCCTATGATT CAGGTGCTTT AGCTGTTAAT ACCACAGTAG 1800 
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GTGGTTACTA CTTAAACTAT AATGGTGAAT GGGTTAAGTA ATGAAGGCTA ATTGTAAACT I860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAAGT CTTCACAATC AAAAAACGCA 1920 

TAGTATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCGAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 2040 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTGC TTAAGCGACT GAAACGACTT 2100 

CTCATAACCG TAAAACATTT C CG ATT TC AG AATCCCAAAG AAGGACTCCA TCATACTATT 2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 2280 

GTGCTTCTCT GTGAATGCCT GTTCCAACAT TGTTTGTACT TGTTCTAAGT TGGGTGAAGT 2340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 2400 

CTTTTGAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2460 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG 2495 
(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

TACCACCGTA TTCATCCAGC AAGATTGCCA TTTGTCTTTG GGTATTTCGC AGTTCTTTTA 60 

GCAAGTCATC CACAAAAATA GTTTCAGGTA CAAAAAGTGG ATCTTGTAAA ATTCTCTTCC 120 

AAACAATATT GTCAAAACCG TCCACAAAGC CTGCCTTAAG GAGACTCTTG GTGTGAATGA 180 

TTCCAATTAC ATTGTCCTTA TCCCCATCAT AAACCGGGAT ACGAGAATAA TTTTGTTTTA 240 

AAATACTTTG GATAATGGCT TGACTATCAT CCTGAATATC CACCATAAAG GCATCCGTTC 300 

GAGGAACCAT AACCTCTCGT GCCATCAGTT CATCGAGCGA AAAGACACCT TGTAGCATCT 360 

CAATCTCATC AGCATCCAAT GTTTCTTCAC TATTTGTCAG CATATAGGCA ATTTCATCAC 420 

GGGTCATCTT TTCATCCGCA TCATCGAATG ACATAGGAGT CAAATGGCTC AAGAAATTGG 480 

TCGAAGCAGC TAAAAGCCAA ACAAAAGGAC TGACTAGTTT TCCGATCCCA ATGATAATCG 540 

GCGCTGTACG AATTGCCAAG GCATCCTTTA GATTAAGAGC GATTCTCTTA GGATATAATT 600 

CCCCAAAAAC GATGGAAATA TAGGTCAAAA ATGCCAAGGA TAGAAAAGTT GCCACGGCTT 660 

GTGCTGTTTC GCCATTCCCA AGCCAAGAGG CAATCACACG TCCTAGAGTA TCAGTTAAAC 720 
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TCGCCCCTGA TAAGATTGTA ATCAGGGTGA TTCCTACCTG GATGGTTGAT AAAAAGTGGT 
TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCGTCT GTCTCCTTCT TCCGCCTTTT 
GTTCAACTCG GGCACGATTA AGAGAAACGG 
(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



780 
840 
870 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA 
AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 
ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGCACTACAG TGGAAGCAAG GAACTGGGGA 
ATGCCCAAGA CTATCTCTAT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 
TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAAGCAGA AGATACTGGT AAATATGAAC 
GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCCTTTTC 
AAAAAATTGC ACTTTCCTCT TGATTTTTTT TGAAAAAGTG GTATCATATA AATATAGAAA 
CGCTGTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTGTA TTATTAGGGA 
AACAAAAGTC TTCTAACAGC ATGTAGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 
AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 
CAGCTTTTTT CTTTGCCTCC TTAGCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 
TCACAGGTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 
GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGTGTTTT TACAACAAAC 
TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTCAA 
TCAACATGGA GATATCTTTA TGTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 
AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT TCAGAGCGAT 
AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 
TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 
GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 
CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCCTTAT TCTGAAAGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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AAACAGCAAG AGTTGATAGA GCTGATAGTG ATGTTTCAAG TCTTG 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 



ATGCCTATGT 


AACTCCACAT 


ATGACCCATA 


GCCACTGGAT 


TAAAAAAGAT 


AGTTTGTCTG 


60 


AAGCTGAGAG 


AGCGGCAcCC 


AGGCTTATGC 


TAAAGAGAAA 


GGTTTGACCC 


CTCCTTCGAC 


120 


AGACCATCAG 


GATTCAGGAA 


ATACTGAGGC 


AAAAGGAGCA 


GAAGCTATCT 


ACAACCGCGT 


180 


GAAAGCAGCT 


AAGAAGGTGC 


CACTTGATCG 


TATGCCTTAC 


AATCTTCAAT 


ATACTGTAGA 


240 


AGTCAAAAAC 


GGTAGTTTAA 


TCATACCTCA 


TTATGACCAT 


TACCATAACA 


TCAAATTTGA 


300 


GTGGTTTGAC 


GAAGGCCTTT 


ATGAGGCACC 


TAAGGGGTAT 


ACTCTTGAGG 


ATCTTTTGGC 


360 


GACTGTCAAG 


TACTATGTCG 


AACATCCAAA 


CGAACGTCCG 


CATTCAGATA 


ATGGTTTTGG 


420 


TAACGCTAGC 


GACCATGTTC 


AAAGAAACAA AAATGGTCAA 


GCTGATACCA 


ATCAAACGGA 


480 






CTCAGACAGA 


AAAACCTGAG 


Kj aao aaacc c 


CTCGAGAAGA 


540 


GAAACCGCAA 


AGCGAGAAAC 


CAGAGTCTCC 


AAAACCAACA 


GAGGAACCAG 


AAGAATCACC 


600 


AGAGGAATCA 


GAAGAACCTC 


AGGTCGAGAC 


TGAAAAGGTT 


GAAGAAAAAC 


TGAGAGAGGC 


660 


TGAAGATTTA 


CTTGGAAAAA 


TCCAGGATCC 


AATTATCAAG 


TCCAATGCCA 


AAGAGACTCT 


720 


CACAGGATTA 


AAAAATAATT 


TACTATTTGG 


CACCCAGGAC 


AACAATACTA 


TTATGGCAGA 


780 


AGCTGAAAAA 


CTATTGGCTT 


TATTAAAGGA 


GAGTAAGTAA 


AGGTAGCAGC 


ATTTTCTAAC 


840 


TCCTAAAAAC 


AGGATAGGAG 


AACGGGAAAA 


CGAAAAATGA 


GAGCAGAATG 


TGAGTTCTAG 


900 


TTCTCATTTT 


TTTCATGAAA 


ATGTGCAAAA 


TATAGTAGAT 


TGAAACTAGA 


ATAGTATACC 


960 


TCTACTTCTA 


AAACATTGTT 


AGAAATCGAT 


TTGACTGTCC 


TGTTCTTATT 


TCATTTTACT 


1020 


ATATCTTAAC 


AGATAGTGTA 


AATAAAGATA 


AACTATTTAC 


TGGCTAATTA 


ATCAGTTAAA 


1080 


CACTAGTTAA 


GGAGTAATGA 


TGAAAAAAAG 


AACAATACTA 


TTATTGATGG 


CCAGTCTGTT 


1140 


AGCTCTTGTC 


TTAGGAGCAT 


GTGGTTTCTT 


GGACATATTG 


ATCCTGGATC 


ATTCTCATCA 


1200 


GGATTACTCT 


TTACTGCTAT 


TTTAGAAACT 


GGGGTGGTTT 


GATGGAAAGT 


ATTGGTCTTG 


1260 


TTATCGTTTC 


ACATTCCAAA 


CACATTGCAG 


AAGGTGTTGT 


TGAACTGATT 


AGTAAAGTAG 


1320 


CTAAAGATGT 


TCCGATTACT 


TATGTAAGAG 


GAACCGAGGG 


CGGAGGAATT 


GGAACGAGTT 


1380 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTG 1440 

ACCTAGGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTCAGGCTG 1560 

GTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGGAgCt TGAAATCAAT AAATAAGGAA 1620 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTGaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



AGGAGTGGAG 


AnATATGAAG 


ACACAAATTT 


TCACATTATT 


GAAAATCGTT 


GCTGAGATTA 


60 


TTATTATTTT 


GCCATTTCTA 


ACTAATCTAT 


AAGTTCTTTA 


TATTGCTGAA 


AACGCAATTC 


120 


AAAAAGGGCT 


ATTAATTGTG 


GATTTTCTAA 


TACCTGCAGA 


GATTGGATAA 


AGCGTTCAAT 


180 


CTCTTTTTGA 


TTGCTTCCCT 


TTGTTTGAAG 


AAAGACACTC 


ATCTTCTTTA AAAATTGCCA 


240 


CGATACTTTT 


TCAAAAACAT 


CATACGGTCG 


TAACATCCTC 


TCCAACTCGG 


CTTCGAAGAT 


300 


TGGGATGTAG 


GAGAAAAGTT 


TTCGCTCCAT 


GAGTTCTGAT 


AAGATATTTA 


AGAGTCCTTG 


360 


CTTCATATAC 


AATCGATTGT 


GTACTAACTC 


TTTAAATTCT 


TTGGATTTTT 


CGAGTAAGGA 


420 


GGTTGATAAA 


AAAATCAGAT 


CTTGATTGCT 


CAAGAAGGGC 


ATGGTATTGC 


AAAAGAGATA 


480 


GAGTTCAAAC 


CAGGTCCAAG 


ACTCGATAGC 


ATAGAGATAG 


GTGGTCAAAA 


ACTCGCTATC 


540 


CTCCTCTGCT 


AGTGGGTAGC 


TTTTATTTAG 


TGAATGGATG 


GCATCTTTAA 


TCACGATGGC 


600 


ATTCAAACGA 


CGATAGGTCT 


GCGCCATCTG 


TTCTTGATCG 


ACTTCCTCCA 


ATAGCTGCTC 


660 


TAAAGCAGCT 


ATATCCTGAT 


GGGCAAAGCG 


ATTCACAACC 


TTTCGACCGA 


TTCGCATATG 


720 


TGGAGATTCT 


TGATAGTTGT 


TGAGCTTGTG 


CCCAAACTCA 


TCAAAGGTCA 


CATTTATACC 


780 


TTGGATAGCT 


AGAATCAACT 


TATCCGCAGA 


CAGCATAGAC 


TGCCCTAGTT 


CAAACTTGGA 


840 


CAACTGAGAA 


GCTGTTAGAC 


CCTCACAAGC 


CACATCTGAC 


TGCTTGAGCT 


TTCTCGCCAA 


900 


ACGTAATTCC 


TTGTAAAATT 


CCCCCAGTTC 


CATTCTCTCA 


ATCATCTGAC 


CACCTCCTAG 


960 


CTTTTGCAGG 












970 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


260: 






GTTGACCACG 


GGTAAAACTA 


CCCTAACTGC 


AGCTATCACA ACTGTTTTGG 


CACGTCGCTT 


60 


GCCTTCATCA 


GTTAACCAAC 


CTAAAGACTA 


TGCGTCTATC 


GATGCTGCTC 


CAGAAGAACG 


120 


CG AACG CGGT 


ATCACTATCA 


ACACTGCGCA 


CGTTGAGTAC 


GAAACTGAAA 


AACGTCACTA 


180 


CGCTCACATC 


GACGCTCCAG 


GACACGCGGA 


CTACGTTAAA 


AACATGATCA 


CTGGTGCTGC 


240 


TCAAATGGAC 


GGAGCTATCC 


TTGTAGTAGC 


TTCAACTGAC 


GGACCAATGC 


CACAAACTCG 


300 


TGAGCAOATP 


CTTCTTTCAC 


GTCAGGTTGG 


TGTTAAACAC 


CTTATCGTCT 


TCATGAACAA 


360 


AGTTGAPTTfi 


GTTGACGACG 


AAGAATTGCT 


TGAATTGGTT 


GAAATGGAAA 


TCCGTGACCT 


420 


ATTGTP Af5 A A 


TACGACTTCC 


CAGGTGACGA TCTTCCAGTT 


ATCCAAGGTT 


CAGCACTTAA 


480 


Afif^PPTTfJ A A 
iv.ll unn 


GGTGACTCTA 


AATACGAAGA 


CATCGTTATG 


GAATTGATGA 


ACACAGTTGA 


540 


TGAGTATATC 


CCAGAACCAG 


AACGTGACAC 


TGACAAACCA 


TTGCTTCTTC 


CAGTCGAGGA 


600 


CGTATTCTCA 


ATCACTGGAC 


GTGGTACAGT 


TGCTTCAGGA 


CGTATCGACC 


GTGGTATCGT 


660 


1 AAACsTCAAC 


GACGAAATCG 


AAATCGTTGG 


TATCAAAGAA 


GAAACTCAAA 


AAGCAGTTGT 


720 


TACTGGTGTT 


GAAATGTTCC 


GTAAACAACT 


TGACGAAGGT 


CTTGCTGGAG 


ATAACGTAGG 


780 


TGTCCTTCTT 


CGTGGTGTTC 


AACGTGATGA 


AATCGAACGT 


GGACAAGTTA 


TCGCTAAACC 


840 


AGGTTCAATC 


AACCCACACA 


CTAAATTCAA AGGTGAAGTC 


TACATCCTTA 


CTAAAGAAGA 


900 


AGGTGGACGT 


CACACTCCAT 


TCTTCAACAA CTACCGTCCA 


CAATTCTACT 


TCCGTACTAC 


960 


TGACGTTACA 


GGTTCAATCG 


AACTTCCAGC 


AGGTACTGAA 


ATGGTAATGC 


CTGGTGATAA 


1020 


CGTGACAATC 


GACGTTGAGT 


TGATTCACCC 


AATCGCCGTA 


GAACAAGGTA 


CTACATTCTC 


1080 


TATCCGTGAG 


GGTGGACGTA 


CTGTTGGTTC 


AGGTATGGTT 


ACAGAAATCG 


AAGCTTAATT 


1140 


CGATTTAGTT 


CCCAGAAGAA 


CAATTATTTA 


AGTTAGACAC 


TAAAAGAATC 


TTGCTTGGCA 


1200 


AGGTTCTTTT 


TTTAGATATT 


GAACTAATAC 


TCAATGAAAA 


TCAAAGAGCA AACTATAATA 


1260 


TATTGAAACT 


AGAATAGTAC 


ACATCTACTT 


CTAAAACATT 


GTTAGAAATC 


GATTTGACTG 


1320 


TCCTGATCGA 


TTTGTCTTGT 


TCTTATTTCA 


TTTTACTATA 


GAAAGTTAGC 


TACAGACTGC 


1380 


TCAAAACATT 


GTTTTTAGGT 


TGTAGATAGA 


ACTGACGAAG 


TCAGtAACAT 


CTATACGACA 


1440 
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AGGCGAAGCT 


GACGCGGTTT 


GAAGAGATTT 


TCGAAGAGTA 


TAATACTAGA 


CTAAAATCAA 


1500 


AAAGCATTAT 


ACAATAGTAA 


TATGAAATCA 


ATTAAAGAAG 


AAATCCAAAC 


CATCAAAACA 


1560 


CTTTTAAAAG 


ACTCTCGTAC 


AGCTAAATAT 


CATAAACGCC 


TTCAAATCGT 


TCTATTTCGT 


1620 


CTGATGGGCA AATCTTATAA AGAGATTATA GAACTTTTAT 


AGTGGTTTGA 


AATAAGATGT 


1680 


GAACAACTCT 


ATCAGGAAAG 


TCAAACTAAT 


TTATAGAAAT 


ATTTTAGCAG 


CCAAGGTGTA 


1740 


CTGTTATAGA 


TTCAATACAC 


TTTAGACTGT 


AATCAAACAA 


CGATTTGGCG 


AAATGTAAAA 


1800 


AATATGAGGA 


GTTCGGACTC 


GACTCTCTCC 


TTCAAGAAAC 


ACGTGGTGGT 


CGTAACCATG 


1860 


CTTATATGAC 


GGTTGAGCAA 


GAGAAAGTCT 


TTCTTGCCCG 


CCATTTGAAG 


GCTACAGAGG 


1920 


CAGGAGAATT 


TGTTACAATT 


GATGCCTTAT 


TTCAGGCTTA 


TAAAAAGGAG 


TTAGGTCGTT 


1980 


CCTACACACG 


TGATGCCTTC 


TATCAACTGT 


TGAAGCGCCA 


TGGTTGGCGA 


AATATTACGC 


2040 


CACGTCCAGA 


ACATCCTAAG 


AAAGCAGATG 


CTCAAACCAT 


TGTCGCGTCT 


AAAAATAAAG 


2100 


TCTCAATTCA 


AGAAGACAAG 


TGAACTGCAC 


CCCAAAAGTT 


AGACAGAAAA 


AATCTAACTT 


2160 


TTGGGGTGTT 


TTTATTATGA AATTAACTTA TGATGATAAA GTTCAGATCT 


ATGAACTTAG 


2220 


AAAACAAGGA 


TATAGCTTAG 


AGAAGCTTTC 


AAATAAATTT 


GGGATAAACA 


ATTCTAATCT 


2280 


TAGGTACATG 


ATTAAATTGA 


TTGATCGTTA 


CGGAATAGAG 


TTCGTCAAAA 


AAGGAAAAAA 


2340 


TCGTTACTAT 


TCTCCTGATT 


TAAAACAAGA 


AATGATTCAT 


AAAGTCTGAC 


ATGAAGGCTG 


2400 


GACTAAAGAT 


AGAGTTTCTC 


TTGAATACTG 


TCTCCCAAGT 


CGTACGATAC 


TTCTTAACTG 


2460 


GCTAGCACAA 


TACAGGAAAA 


ACGGGTATAC 


TATTGTTGAG 


AAAACAAGAG 


GGAGAGTACC 


2520 


TGAGAGCGGA 


GAATGCCATC 


CTAAAAAAGT 


TAAGAGAACT 


CCGATTGAAG 


GAGGAAAAAG 


2580 


AGAAAGAAGA 


AAGACAGAAA 


TTATTCAAGA 


ATTAATGACT 


GAGTTTTCGT 


TAGATATTCT 


2640 


TCTAAAAGCC 


ATTAAACTAG 


CTCGTTTGAC 


CTACTACTAT 


CACTTGAAAC 


AGCTAGATAA 


2700 


ACCAGATAAG 


GACCAAGAGC 


TTAAAGCTGA 


AATTCAATCC 


ATTTTTATCG 


AACACAAGGG 


2760 


AAATTATGCT 


TATCGTCGGA 


TTTATTTAGA 


ACTAAGAAAT 


CGTGGTTATC 


TGGTAAATCA 


2820 


TAAAAGAGTT 


CAAGGCTTGA 


TAAAAGTACT 


CAATTTACAA 


GCTAAAATGC 


GACAGAAACG 


2880 


AAAATATTCT 


TCTCATAAAG 


GAGACGTTGG 


CAAGAAGGCA 


GAGAATCTCA 


TTCAAGGACA 


2940 


ATTTGAAGGC 


TCTAAAACAA 


TGGAAAAGTG 


CTACACAGAT 


GTGACAGAAT 


TTGCCG 


2996 


(2) INFORMATION FOR SEQ ID NO; 261: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 60 

GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTGCAGTA GCATGATTTC 120 

TCTCATTTCT ATCTTTTTTA AAGAGTAAAC TCAGCTAGTC CAACTAACTG AGTTTTCCTT 180 

TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 240 

TCTTTAGGGG TGTGGATAAT CTCATCCGGA GTTCCAGACT CGATGATTTT CCCCTTATCT 300 

AAGAAGAGAA TTTTATCCGC AACTTGGGCT ACAAAGGACA TGTCATGACT GACCAAAATC 360 

ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC GACCAATTCT 420 

GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 480 

CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA ATGGTTTTCA 540 

CGGTCCGAAA GCCCAACCTT AGCCAACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 600 

GATAATTTCT TGACAACAAC CAAGCCTTCT TTCACATTAT CAAGTGCTGT TCGGCGTTCA 660 

AACAAATTAA ACTGTTGGAA AACCATAGAC AACTTACGAC GTAGGGCAAG GATTTCTTCT 720 

TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 780 

GTTTCTAGAT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 837 
(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

CCGAACAAAA TGGGCTAATT AGATTATAGT AAGAAAGGTA AGTTAAAAAT GAGAATTGCA 60 

ATTGGATGTG ACCACATCGT AACTGATGAA AAAATGGCGG TTTCAGAATT TTTGAAATCA 120 

AAAGGATATG AAGTCATTGA CTTTGGTACC TATGACCATA CACGGACTCA CTACCCAATC 180 

TTTGGTAAAA AAGTAGGGGA AGCTGTAACT AGCGGTCAAG CTGATCTTGG AGTATGTATC 240 

TGTGGTACTG GTGTTGGTAT CAACAACGCT GTAAATAAAG TTCCAGGTGT TCGTTCTGCC 300 

TTGGTTCGTG ATATGACAAC AGCCCTTTAT GCTAAAGAAC AATTGAACGC TAACGTTATT 360 

GGTTTTGGTG GTAAAATTAC TGGTGAATTG CTTATGTGTG ATATCATCGA AGCTTTCATC 420 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 480 

GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 540 

GATCGTGGAG AATACCACGA CTAAGAGGTG ACCTATGATT TTAACAGTCA CAATGAACCC 600 

ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGGT 660 

GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATGTT ACCCGAGTAC TTTCAGAATT 720 

TGGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 780 

ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTG 840 

TATCGCTATT CTCCACGGAG ACAACCAA 868 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

CCGTTCAAAG TCTTCATAAG ACTCGAAAGT CACAGTTCTT TCGTTCTTGC TGGCATCTAT 60 

ATAGGTAATT TCAATCATGT TTAAAACTCC TTTGTTTAAT GCTAACTTTA TTTTACTCCT 120 

TATAAAAGAG AATGTCAAGA AAAATGATTG CGCACGCAAC TTTTTTTAAA ATCATCTTAA 180 

ATCAAGAAAT CCAAACCTGC TTCCAAGCTT TCTTCGACAG TCTTTTGTAG CGAGGCCAGT 240 

GTCTTTTGCC CATCATTTGT CAGGCAGATA AAACTAGAGC GTCTATCTTG ATGGCAACAC 300 

ATGCGACTGA GTAGACCGCA ATTTTTAGCT TCCAAGCGAG CCACCATCCT AGAAACTGCG 360 

CTCGGGCTCA GATGAAGCTT ATCTGGCAGG TCAATCTGGC GTAGAGATTT TTCTTCAGCC 420 

AAGTCCAGAT AGTAGAGCAG GTAGAACTCT TTCAAGGTCA GACTTTGCTC GCTCTGTTGG 480 

GCAATGGTCT CTTCCAAGAG ACTTTCAATT TCTTTCTGAC GCCGATTGAA GTCAAACCAT 540 

TTTTCCAAAT AGGTCATAGT GTCTCCTTTC TTTTTAGAGT CATAAATAGA AGAAAGTCCA 600 

TTAACGGGCA GTCTCTGCGT CACAAGATGA TTGCGCATGC AATAATTATA CTACTTTTCA 660 

AGAATGCTGG CAAGCTCTGT TTTTTAGTGG TTTTATTTTT GTGTGAATAA TGGGGGAATC 720 

CTATTGTTTC AATTTCTAAC TCCTTATCAC ATTCGAATTC AGATTTTATT TCATTTCTCT 780 

ATCTATAGTT GCTTAGTTTA AAATAAGCAT GGTCTAATAA AGCTATGCAT ATAGTACTGA 840 

TTTTAAACAA GGAGCATTAG ATTCCATTAA AGGAGGGCAC AGACATGTCG AGGCGGCCAA 900 
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AGTTTTTGAT 


GTCGGCGTCA GAACTCTCTT 


CACGTGGGAA 


AAGAAAGACG 


TAAACAAGGG 


960 


AACTTAGAGC 


GGAAAAAGCG AGTCGTCAAA 


AAGCGTAAGA 


TCCCTTTAGA 


AGAATTGAAA 


1020 


GCCTTTGTAG 


AGGCTCATCC 


AGACGCTTTT 


TTACGGGAAA 


TTGCGGCCCG 


X X X X X X W X 


1080 


GCTTTGCCCT 


CCGTATGGGC 


AGTTTTAAAG 


CAGATTAAGG 


TCATTTTAAA 


AAAGACGACC 


1140 


AGTTTTAGGG 


AACAAAAGCC 


TGAGAAAGTT 


TCTGAGTTTC 


TTGATATTTT 


GGATAAPPTA 


1200 


AAAGATTTAC 


CAGTCCTATA 


TATTGACGAA 


ACGGGAATCG 


ACCGCTACCT 


PTATPfSTPPT 


1260 


T ATGCAGGGG 


CTCCTAGAGG 


GGAGAAAGTC 


TATGGCAAGA 


TTAGCGGACG 


GCGTTTTGAf! 

x x x x vvrvvj 


1320 


CGGACTAATG 


AGGTGGAGCA 


AAAACTCAAT 


GGTAGTTTTC 


TAATCAGATA 


TATTGATTCA 


1380 


CAAATTAGAG 


AATGAAAGAA 


TAATTATGCA 


TAAAAATAGG 


AATATAAACC 


AAAAATTAGP 


1440 


TGATTTATAC 


TCATTTGCGT 


GTCTTTATAA AAAACTTATC 


TTATAATATA 


TAT AT A T A T A 


1500 


TATACAAAAT 


AGTAAAATGC 


TTTTTTTTTT 


TAGCAAAAAT 


APPTP A AfiTT 


TPTTPJ PT ITT 


IjOU 


TTGGGTTCCC 


TATTCTATAA 


TTATAGTATG 


GTAATTTATT 


TATATPPATA 


PaTY^aa a »fp & 


1 son 


ATACTCGAAA 


GGAAATTTCA 


AAATATTTTT 


TAfiAPfiTPAf: 


AAGGCTG A AT 


A T & f2 & fZ A & a P 


lOOU 


AGACCGAGTA 


ACTCGGTTCA 


AATTAATCAA 


ATCAGGGAAG 






1 / *1 u 


TTCTCTTTTT 


GGCTTGTTTA AGGTCTTGCG 


AGGTGGTGTT 


uAx i Av» L L. 


Avj(j I t, A 1 oAL 


1800 


CGAAACGGTA 


GAAGATAAAG 


TAAGTCATTC 


AATTACTGGG 


Ux IvnlnlLL 


IXAAIjvjGwVT 


1860 


AGTTGCTGCG 


GGAGCTGTCA 


TAAGTGGAAC 


CGTTGCAACT 


p a a a r*n a a nn 


1A1 1 xAt_AAA 


i y z u 


TGAGTCAGCA 


GTACTTGAAA 


AAACTGTAGA 


GAAAACGGAT 


RPTwPTVlftP A A 
VJVw 1 X 1 VjVjV_/\/\ 


V. AAA 1 un 1 Av. 


1 Q Q f\ 


AGTAGTTCTA 


GGTACGATAT 


CTACAAGTAA 


TTCAGCGAGT 


TP.AAPTAfi'PT 


TRTPAfiPTTP 




AGAGTCGGCA 


AGTACATCTG 


CATCTGAGTC 


AGCCTCAACC 


AGPOPVPPfiA 


\_>v~ x ^nuvnnu 


2100 


TACAAGTGCA 


TCAGAATCAG 


CAAGTACATC 


GGCTTCGACA 


AGTATTTCTG 


PATPATPTAP 


2160 


TGTGGTAGGT 


TCACAAACAG 


CTGCCGCTAC 


AGAAGCAACT 


GCTAAGAAGG 


TCGAAGAAGA 


2220 


A ^-vj 1 AAuAAA 


CCAGCTAGTG 


ATTATGTAGC 


ATCAGTTACA 


AATGTCAATC 


TCCAATCTTA 


2280 


TGCTAAGCGA 


CGCAAGCGTT 


CAGTGGATTC 


CATCGAGCAA 


TTGCTGGCTT 


CTATAAAAAA 


2340 


TGCTGCTGTT 


TTTTCTGGCA 


ATACGATTGT 


AAATGGCGCC 


CCTGCAATTA 


ATGCAAGTCT 


2400 


AAACATTGCT 


AAAAGTGAGA 


CAAAAGTTTA 


TACAGGTGAA 


GGTGTAGATT 


CGGTATATCG 


2460 


TGTTCCAATT 


TACTATAAAT 


TGAAAGTGAC 


AAATGATGGT 


TCAAAATTGA 


CCTTTACCTA 


2520 


TACGGTTACG 


TATGTGAATC 


CTAAAACAAA 


TGATCTTGGT 


AATATATCAA 


GTATGCGTCC 


2580 


TGGATATTCT 


ATCTATAATT 


CAGGTACTTC 


AACACAAACA 


ATGTTAACCC 


TTGGCAGTGA 


2640 


TCTTGGTAAA 


CCTTCAGGTG 


TAAAGAACTA 


CATTACTGAC 


AAAAATGGTA 


GACAGGTTCT 


2700 
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ATCCTATAAT ACATCTACAA TGACGACGCA GGGTAGTGGG TATACTTGGG GAAATGGTGC 2760 

CCAAATGAAT GGTTTCTTTG CTAAGAAAGG ATATGGATTA ACATCATCTT GGACTGTACC 2820 

AATTACTGGA ACGGATACAT CCTTTACATT TACCCCTTAC GCTGCTAGAA CAGATAGAAT 2880 

TGGAATTAAC TACTTCAATG GTGGAGGAAA GGTAGTTGAA TCTAGCACGA CCAGTCAGTC 2940 

ACTTTCACAG TCTAAGTCAC TCTCAGTAAG TGCTAGTCAA AGCGCCTCAG CTTCAGCATC 3000 

AACAAGTGCG TCGGCTTCAG CATCAACCAG TGCCTCGGCT TCAGCGTCAA CCAGTGCGTC 3060 

AGCTTCAGCA AGTACCAGTG CTTCAGTCTC AGCATCAACA AGTGCTTCAG CCTCAGCATC 3120 

GACAAGTGCC TCGGCTTCAG CAAGCACATC AGCATCTGAA TCAGCGTCAA CCAGTGCTTC 3180 

GGCTTCAGCA AGTACCAGTG CTTCAGCTTC AGCATCAACC AGCGCCTCGG CCTCAGCAAG 3240 

CACCTCAGCT TCTGAATCGG CCTCAACCAG CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC -3300 

TGAATCGGCC TCAACCAGCG CCTCAGCCTC AGCATCAACG AGTGCTTCGG CTTCAGCAAG 3360 

CACAAGCGCC TCGGGTTCAG CATCAACGAG TACGTCAGCT TCAGCGTCAA CCAGTGCTTC 3420 

AGCCTCAGCA TCAACAAGTG CGTCAGCTCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 3480 

ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT CAGCAAGCAC CTCAGCTTCT 3540 

GAATCGGCCT CAACCAGTGC GTCACCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA 3 600 

CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG 3 660 

CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC GGCTTCAGCA TCAACCAGTG 3720 

CCTCGGCTTC AGCGTCAAAC AGTG 3744 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

CGATAAAGAG GCCTTGAGTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 

GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTATCATTTC 120 

ACCCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCGGAAAATC GCTTGGCTAT 180 

TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 240 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGAGT AGATCTGACT TGGAGGCTAG 300 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGG TTATTGAAAC 360 

TCTTTCTCAC GGAATGCGTC AGAAAGTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 420 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGGATCCC CAGGCTGCCT TTGATTTGAA 480 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 540 

AGAGGTGGCA GAGCAAGTCT GTGATCGGAT TGCCATTTTG AAAAAGGGGC ATTTGATTTA 600 

TTGTGGTAGT GTAGAGGACT TGAGAAAAGA TTACCCAGAC CAGTCTTTGG AAAGTATCTA 660 

CCTTAGTCTT GCTGGTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTTGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAAGA 780 

AGCAGGCTAA GAATC 795 



(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

TGGTAATGTG CTTGGCAGCw TCCTTGACAC TGCTACTACC ATTTCCCATA GCGACCGACA 60 

TACCAACGCC AGCCAGCATT TCAAGATCAT TATCTGAGTC ACCAAAAGCC ATGACTTGGT 120 

TGAGGTCAAA GCCATATTCT TTCCCAACTC GGCGAATGCC TTCTAATTTA GAATTTCCCT 180 

GATTGATGAC ATCCGATGCA AAAGGATTGC TACGTGTCAA TTTCAAGTCT TCAAAATCAG 240 

CTGCCGCCTT CTCAGATTCT TCTGGTGTCA TCAGCATCAA AACTTGGTAG ATAGGCTGAT 300 

TCATCAGGTG AAGCAGGTCC TCTTCCTTTT GGGGAACAAC CTTGCTGACC ATGCGATTAA 360 

AAGACTGACT CACCGTCCGA GTTAAAACAG AGGGAACGAA GCGACTAATT CGTTGGGAAA 420 

AAGAACCCAG ACCAAAGGAC ATGATTTTAG AACCCAACAT GGCATCCTTG GTCCCTAG^G 480 

CAATCTCCGT GCCCTCTTTT TTAGCATAGC TAATTAGATG GCGCAAATGT AACTTGGAAA 540 

TAGGGCTCGT GAACAAGACT CTGTCTTTAC TAAAGATATA CTGGCCATTA TAGGTTACCG 600 

CAAAATCCAG ATCCAAATCG TCCATCAATT CCTTAACAAA AAAAGGTCCT CGCCCTGTCG 660 

CTACGCCAAC TAGTACCCCT TGTTCTTTGA CAATCTTAAT CGCATCCTTA GTGGATTTCA 720 

AAACACTCTT GCGATTGTTG ACCAAGGTTC CATCGATATC AAAAAAAACA GCTTTGACTT 780 

CCATCCTATC CCAATCTCCC CTTTTGTGAT ACAATGATTA TACCACATTT CAGAAAGAGT 840 

GAGTAAATCA TGCCTAAGAA AATCCTTGTT TTACATACGG GTGGAACTAT TTCCATGCAG 900 
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GCCGATGCTT CTGGCGCTGT TGTGACGAGT TCAGATAATC CCATGAACCA TGTGTCCAAC 960 

C CACTTG AAG GAATCCAAGT CCACGCCTTG GACTTTTTTA ACCTTCCAAG TCCCCATATC 1020 

AAACCCAAAC ATATGCTGGT CCTCTACCAG AAAATTAAAG AGGAAGCAGA TAACTACGAT 1080 

GGAGTGGTGA TCACACACGG AACCGATACT TTAGAGGAAA CAGCCTATTT CCTTGATACC 1140 

ATGGAAGTTC CCCATATGCC TATCGTTCTA ACAGGAGCCA TGCGTACtCC AATGAGCTCG 1200 

GTAGTGATGG TGTTTATAAT TACCTAAGTG CTTTACGAGT GGCCAGCGAT GACAGGGCTG 1260 

CTGACAAAGG AGTTTTGGTC GTTATGAACG ATGAAATCCA CGCTGCCAAG TATGTCACCA 1320 

AAACACATAC GACTAATGTC AGCACCTTCC AGACTCCAAC ACATGGCCCC CTTGGTCTCA 1380 

TCATGAAACA GGAAATCCTC TACTTCAAAA CAGCTGAACC TCGTGTTCGC TTTGACCTTG 1440 

ATCACATACA AGGTTTAGTC CCTATCATCT CGGCTTATGC TGGTATGACA GATGAGCTGA 1500 

TTGATATGCT GGATTTAGAA CACTTGGACG GTTTGATTAT CCAAGCCTTC GGAGCTGGTA 1560 

ATATTCCCAA AGAAACGGCT CAAAAATTAG AAAGCCTTCT GCAAAAAGGA ATTCCAGTCG 1620 

CTCTGGTATC ACGATGCTTT AACGGTATTG CCGAGCCTGT TTATGCATAC CAGGGTGGGG 1680 

GCGTACAGTT GCAAAAAGCA GGCGTTTTCT TTGTTAAAGA ACTCAACGCC CAAAAAGCTC 1740 

GCTTGAAACT CCTCATCGCC CTCAATGCCG GACTAACAGG ACAGGCTTTG AAAGACTATA 1800 

TGGAAGGCTA ATACTCTTCG AAAATCTCTG CAAACCACGT CACGTCGCCT TACCGTATGT 1860 

ATGGtACTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAA CATGTTTTGA GCTGACTTCG 1920 

TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA CTTCGTCAGT TCTATCTACA 1980 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTCTAT CTACAACCTC AAAAACATGT 2040 

TTTGAGCTGA CTTCGTCAGT TCTATCTACA ACCTCAAAAA CATGTTTTGA GCTGACTTCG 2100 

TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA CTTCGTCAGk TCTATCTACA 2160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TTAGTTTCAT CTACAACCTC AAAAACATGT 2220 

TTTGAGCTGA C 2231 
(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1310 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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GAGTCAAAGG 


CTCCGAGGTT GACTTTTTAC AAGGGGACAG 


GTGAATATTA 


TCTAGACCTG 


60 


TCAGAAATTC 


TCTTCTTTGA AACAGAAGGG AGCAAGATCT 


ACGCTCATAA CCAGAAGGAA 


120 


GCTTATGAGG 


TTCGCCTCAA GCTCTATGAG TTGGAGTCTA 


TCTTGCCTCG 


CTATTTTAAT 


180 


CGAGTTTCCA AGTCAACGAT CGCAAACATC CGTCAGATTT 


At_ 1 LAbTGGA 


CAAGTCCTTT 


240 


TCAGGAACGG 


\jv-.n\^\_n uiL U. 1 1 1 1 A AL.GvJALAAGG 


AGGTTCATGT 


CTCACGGCAT 


300 


TACCAATCCC 




A A A TV X A A m/> 

AAAAAACATG 


AAAAAGAAAG 


360 


CATTTGGTAT 


TGTTTTATTfi <^T w PT w rar2r , ivr» rnuiwitTirmm 


GCTGCAAGGG 


AATTTTGGAA 


420 


TTCCTTCTTT 


wj« i i /u\a A J. A 1u(jH- 11 I At- 1 AGG 1 AT 


TGTTTTTTTT 


GCTTATAAGT 


480 


CCATTGAGTC 


u/t n_ i_ i i AtaA Lb l l_ ATCTC A CTTCGGC AGT 


TTTTACAGGT 


TTACTGGCGC 


540 


TCATCATTGC 


"™ * 1 i i A IajAL 1 Ivj 1 1 At_(_ AGTTAC 


CAATCATTCT 


CTTATTTGGG 


600 




GGTGGTACTT GGTGTTGGTT ATCTGACGCA 


TTCAAGTAAG 


TTCTGGAATG 


660 


AAAAAAAATG 


GTGGTACAAT GGGAAAAAAA cagtcgtcac 


GGATAAGGAA 


GTCGCTTTTG 


720 


GTAGCGGGAC 


CTTCTATAAG CAAGATCAAG ATCTCGTAGA 


TGACCAAGTG 


GAAGTCGCTT 


780 


TTGGGGATGC 


TAAAATCTAC TATGATAATG cagagatgct 


AGGTGATTTT 


GCAACTTTAA 


840 


ATATTGAAGT 


GGCCTTCGGG AATGCAACCG TCTATGTTCC 


ACAACACTGG 


CGTGTAGATT 


900 


TGAAAGTAGA 


AACCTCCTTT GGTGCAGCTA AGGCTGACGC 


TCCTGTAGCC 


CCAACCAGCA 


960 


AAACCTTGAT 


TATCCGTGGA GATGTGGCTT TTGGGAAGTT 


GGAAATTGTC 


TACGTTAAAT 


1020 


AAAAAAATCT 


TCACTTCAAC CATCAAAATA GACGTACTAA 


GAGTAGGAAA 


TTGATGCCTT 


1080 


GCTCTGATTT 


CAGTTCTATG GTTGTTAGAC TTTAAAAAAT 


GAAATGCTGC 


CTTTAAAAGT 


1140 


TGTATATTTT 


TCGATATTTT GGCTTTTACG TTTGATGTAT 


CTATGTACTA 


CAGCGTAGAT 


1200 


GATGTAGTGT 


CAAATGCTTT TAAAAAACGG ATGATATTGG 


ACAGTTTTTT 


TGCCTTTAAT 


1260 


TGCTCAGGAA 


CCATGAAAGT CAGTACCTGG GTTTATGACA 


AGGGAGAATG 




1310 


(2) INFORMATION FOR SEQ ID NO: 267: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
ACTCTGATTT GATTGGAACG ACAGTCGGTG CCATTGCAGT TACTTCAAAC GTAACGACTT 
ATGTTGAGTC TGCTGCTGGT ATCGGTGCAG GTGGACGTAC TGGTTTGACA GCCTTGGTTG 



60 
120 
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TAGCTATCTG 


TTTTGCGATT 


TCAAGCTTCT 


TTAGCCCACT 


TCTAGCGATC 


GTACCAACAG 


180 


CGGCTACAGC 


TCCAATCTTG 


ATTATCGTTG 


GGATTATGAT 


GCTTGGTAGC 


TTGAAAAATA 


240 


TCCATTGGGA 


TGATATGTCT 


GAAGCAGTTC 


CTGCCTTCTT 


CACATCTATC 


TTTATGGGAT 


300 


TCAGCTACTC 


TATCACTCAA 


GGGATTGCAG 


TTGGTTTCTT 


GACTTACACT 


TTGACTAAGC 


360 


TTGTTAAAGG 


TCAAGTTAAA 


GATGTTCATG 


TCATGATTTG 


GATTTTGGAT 


GCCTTGTTTA 


420 


TGCTTAACTA 


CATCAGCATG 


GCCTTATAAT 


AGAATGACCC 


AGGGGGATTT 


CCCCCCTTTT 


480 


TTAATACAaG 


GAGATAGGTG 


ATGAAAGAGA AAAATATGTG 


GAAAGAATTG 


TTGAATCGTG 


540 


CAGGCTGGAT 


TTTGGTCTTT 


TTACTTGCCG 


TCCTTTTATA 


TCAGGTTCCC 


CTAGTGGTTA 


600 


CCTCTATTTT 


GACTTTAAAA GAAGTAGCCC 


TGCTACAGTC 


AGGGCTGATA 


GTTGCTGGCC 


660 


TTTCAATTGT 


GGTTCTGGCT 


CTATTTATTA 


TGGGAGCTCG 


TAAAACCAAG 


TTAGCTAGTT 


720 


TTAATTTTTC 


TTTTTTTAGA 


GCTAAAGATT 


TGGCACGTTT 


GGGCTTGAGT 


TATCTAGTTA 


780 


TTGTCGGGTC 


AAATATACTT 


GGTTCCATTT 


TATTGCAACT 


GTCAAATGAG 


ACGACAACAG 


840 


CTAACCAGTC 


TCAGATTAAT 


GATATGGTTC 


AAAATAGTTC 


GTTGATTTCC 


AGTTTCTTCT 


900 


TGCTAGCCTT 


GCTTGCTCCG 


ATTTGTGAGG 


AAATCTTGTG 


TCGTGGGATT 


GTTCCTAAAA 


960 


AGATTTTCCG 


AGGCAAGGAG 


AACTTGGGAT 


TTGTAGTCGG 


TACGATTGTG 


TTTGCTTTAT 


1020 


TGCATCAACC 


AAGTAATTTA 


CCTTCTTTAT 


TGATTTATGG 


AGGTATGTCG 


ACAGTTCTAT 


1080 


CTTGGACAGC 


CTACAAGACC 


CAACGTTTGG 


AAATGTCGAT 


CTTGCTTCAC 


ATGATTGTTA 


1140 


ATGGGATTGC 


TTTCTGTTTG 


TTGGCTCTTG 


TGGTGATTAT 


GAGTCGGACA 


TTAGGAATTT 


1200 


CTGTTTAAAA 


GTTTTTATGT 


AGGAACCGAC 


CTCTTTCTAC 


CAGGGAAAGA 


TGAATGCAAT 


1260 


CGTGTCCATC 


TTTTTCTTTT 


TATGGTAAAA 


TAGAAAAATA 


ATATGATGAA 


AATCCTTGAG 


1320 


GGAGTGACCG 


ATATGTCAAG 


TAAAGCCAAT 


CATGCAAAGA 


CAGTTATTTG 


CGGAATTATC 


1380 


AATGTAACCC 


CAGACTCCTT 


TTCGGACGGT 


GGTCAATTTT 


TTGCTCTTGA 


GCAGGCGCTC 


1440 


CAGCAGGCTC 


GTAAATTGAT 


AGCAGAAGGA 


GCCAGTATGC 


TAGATATCGG 


CGGAGAATCG 


1500 


ACTCGGCCGG 


GAAGTAGCTA 


TGTTGAGATA 


GAAGAGGAAA 


TCCAGCGTGT 


TGTTCCAGTG 


1560 


ATCAAAGCGA 


TTCGCAAGGA 


AAGTGATGTC 


CTCATCTCTA 


TTGATACTTG 


GAAGAGTCAA 


1620 


GTAGCAGAGG 


CTGCTTTGGC 


TGCTGGTGCC 


GATCTAGTCA 


ATGATATCAC 


TGGTCTTATG 


1680 


GGTGATGAGA 


AAATGGCTTA 


TGTGGTAGCT 


GAAGCGAGAg 


CGAAAGTGGT 


CATCATGTTT 


1740 


AACCCAGTTA 


TGGCTCGACC 


TCAGCATCCT 


AGTTCGCTTA 


TCTTCCCTCA 


TTTTGGTTTT 


1800 


GGTCAAACCT 


TTACAGAAAA 


AGAGTTAGCT 


GACTTTGAAA 


CATTGCCAAT 


CGAAGACTTG 


1860 
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ATGGTGGCTT TCTTTGAACG AGCACTAGCG AGAGCGGCAG AAGCTGGTAT TGCACCAGAA 1920 

AATATCCTGT TGGATCCAGG AATTGGCTTT GGTCTGACCA AGAAAGAAAA TCTGCTTCTT 1980 

TTACGGGACC TGGATAAACT ACATCAGAAG GGCTATCCAA TCTTTCTCGG AGTGTCGCGC 2040 

AAGCgATTTG TCATCAATAT CCTAGAGGAG AATGGTTTTG AAGTCAATCC TGAGACAGAG 2100 

CTTGGTTTCC GAAATCGGGA CACGGCTTCG GCTCATGTAA CTAGTATCGC TGCGAGACAG 2160 

GGTGTAGAAG TGGTGCGCGT GCATGACGTA GCTAGTCACA GGATGGCAGT TGAAATTGCC 2220 

TCTGCCATTC GTCTGGCTGA TGAAGCGGAA AATTTAGATT TAAAACAATA TAAATAAGAT 2280 

GAAAGAAATT GAAAACAATC AGTGGATTGC TAACTACCGG ACGGATCAAC CGCATTTTGG 2340 

CTTGGAACGA ATGGTGGAAC TGTTAGCTTT GCGTGGCAAT CCCCATCTCA AACTCAAGGT 2400 

CCTCCATATC GGAGGGACTA ACGGCAAGGG CTCGACTATT GCTTTTTTGA AAAAGATGCT 2460 

AGAAAAGCTA GGGTTGAGAG TTGGCGTGTT TAGCTCGCCC TATCTCATTC ATTACACAGA 2520 

CCAGATTAGC ATCAATGGGG AATCGATCTC AGAAGCGAGG CTAGAAGCTC TCATGGCAGA 2580 

CTATCAGTCT TTGCTGGAGG GAGAAGCGGT CGCCAATTTA CAGGGCACAA CCGAGTTTGA 2640 

GATTATCACA GCCCTGGCCT ATGACTACTT TGCCTCAGAG CAAGTAGATG TGGCCATCAT 2700 

GGAAGTTGGC ATGGGTGGAC TTTTGGATAG TACCAATGTC TGTCAGCCCA TTTTGACAGG 2760 

AATTACAACT ATTGGCTTGG ATCATGTGGC TCTACTTGGT GACACCTTGG AGGTCATAGC 2820 

AGAGCAGAAG GCAGGTATTA TCAAACAAGG GATGCCCTTG GTAACAGGGC GTATTGCTCC 2880 

AGAAGCCTTG GCTGTGATTG ACCGCATTGC GGAAGGGAAA GATGCGCCGA GACTTGCCTA 2940 

CGGGACAGAT TATCAGGTTC GTCATCAAGA AAGTGTGGTG ACAGGGGAAG TCTTTGACTA 3000 

TACAAGTGCT GTCAGACAAG GTCGCTTCCA GACTAGCCTG CTTGGTTTGT ACCAAATAGA 3060 

GAATGCTGGG ATGGCCATAG CTTTACTTGA TACTTTTTGT CAAGAAGATG GTCGAGAGCT 3120 

AGCAAGCAAT GATTTTCTTG GTCAAGCCTT GGAAGAAACA AGTTGGCCAG GGCGTTTGGA 3180 

AATCGTGTCA AGAGATCCCT TGATGATTTT GGATGGAGCC CACAATCCCC ATGCTATCAA 3240 

GGCCTTGTTG GTAACCTTGC AAGAACGTTT TGCGGATTAT CATAAGGAAA TCCTCTTCAC 3300 

TTGTATCAAA ACCAAGGCCT TGGAGGATAT GTTGGACTTG CTGGGAGCCA TGCCAGTTAC 3360 

CGAGCTTACT CTAACACATT TTGCGGATAG TCGGGCGACG GATGAAAACG TGCTGAAAGA 3420 

GGCAGCTAAG TCTAGAAATC TCAGCTACCA AGATTGGCAT GATTTTCTAG AGCAGAATTT 3480 

GACAGATAAA AAAGAAGAGA AACAAACAGT TAGGATTGTC ACAGGTTCCT TGTATTTCTT 3 540 

GAGCCAAGTG AGGGCCTATC TGATGGAGAG GAAGAACGAG AATGGATACA CAAAAGATTG 3 600 

AAGCGGCTGT AAAAATGATT ATCGAGGCTG TAGGAGAGGA CGCTAATCGC GAGGGCTTGC 3 660 
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AGGAAACACC TGCTCGTGTA GCCCGTATGT ATCAAGAGAT TTTTTCAGGT CTTGGTCAAA 3720 

CAGCAGAGGA ACATTTGTCA AAATCCTTTG AAATTATTGA CGATAATATG GTGGTAGAAA 3780 

AGGATATCTT TTTCCATACC ATGTGTGAAC ACCACTTCTT GCCATTTTAT GGTAGAGCGC 3840 

ACATTGCCTA CATTCCAGAT GGTCGTGTGG CAGGCTTGTC TAAGCTAGCC CGTACGGTTG 3900 

AAGTTTATTC GAAAAAACCA CAAATTCAAG AACGTTTGAA TATCGAAGTG GCCGATGCCT 3960 

TGATGGACTA TCTAGGTGCT AAAGGAGCCT TTGTTGTCAT TGAGGCGGAA CATATGTGTA 4020 

TGAGTATGCG TGGTGTTAGA AAACCAGGCA CTGCAACCTT GACGACAGTA GCTCGTGGTC 4080 

TATTTGAAAC AGATAAGGAT CTCCGTGACC AAGCTTATCG TTTAATGGGG CTATAAAAAG 4140 

AATCCGCTTC AAGCGGATTT TTCTAGAAAG GAATCATTAT GGATCAACTG CAGATTAAGG 4200 

ATTTGGAAAT GTTTGCCTAT CATGGTCTTT TTCCTAGTGA GAAAGAATTG GGGCAGAAAT 4260 

TTGTCGTTTC AGCCATCCTA TCCTATGATA TGACCAAGGC AGCTACAGAC TTGGATTTAA 4320 

CAGCCTCTGT CCATTACGGA GAATTGTGTC AGCAGTGGAC GACTTGGTTT CAGGAAACGA 4380 

GTGAAGATTT GATTGAAACG GTAGCCTATA AACTGGTGGA ACGTACCTTT GAGTTTTATC 4440 

CTCTTGTCCA AGAAATGAAG TTGGAACTGA AAAAACCTTG GGCACCGGTG CATTTGTCAC 4500 

TAGATACTTG CTCGGTAACC ATTCATCGCC GCAAGCAACG AGCCTTTATC GCCCTAGGAA 4560 

GCAATATGGG AGATAAACAA GCAAACTTGA AGCAAGCCAT TGACAAACTG CGAGCTCGTG 4620 

GCATCCATAT TCTCAAAGAG TCCAGTGTCT TAGCGACGGA GCCTTGGGGT GGAGTGGAGC 4680 

AGGATAGCTT TGCCAATCAA GTGGTTGAGG TGGAAACCTG GCTACCAGCA CAAGACTTGT 4740 

TAGAAACCTT GTTAGCCATT GAGTCAGAGC TGGGACGGGT GAGAGAAGTG CATTGGGGAC 4800 

CTCGTTTGAT TGATTTGGAC TTGCTCTTTG TGGAGGACCA GATCCTTTAT ACAGACGACC 4860 

TCATATTGCC TCATCCTTAC ATAGCGGAAC GCCTTTTTGT CCTTGAGTCt TACAGGAAAT 4920 

TGCGCCTCAT TTTATCCATC CGATATTAAA ACAACCGATC CGCAACTTGT ATGATGCTTT 4980 

GAAAAAATAG AAAAACTCTA GTTTTCAGTT ACTTGCAACT GAAGGCTAGA GTTTTTATAC 5040 

TCTTCGAAAA TCTCTTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG TACAGCTTGC 5100 

GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAAAATAGGT CATTTTCTTC 5160 

TGGGAGGAGG ATAGTTTCTC TACCGTCCAT GTCTAAAACC AGTACTCTTG GGGGATAACG 5220 

AGGGTCGAAA GGATGGTTAA AGTCAAAATC AATGGCTGTA GGGAGGTGTT GACTTGAAAA 5280 

GTGGAAGGTA ATCTTTCCTT GGTTATTAAG CAATTGAAAC TCGAGTTCTT CTTCCAATTC 5340 

AAAGACATTT TTTAAGAAAT GGTCGATGAT ATACCAAAAA GAGTCAATGA TGTCATCAGG 5400 
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CAAGCTGGTA ACAATACCAA AACTAGCAGA TCGCATGTGG GTATTGGTAA AAGCCATATC 54 60 

TCTGTCCCCT TTCTTTTCCC TTATCATACA GCAAATAGGA TTAAAAATCA AGAAAAGGTG 5520 

ATTTTTTGAA AAGGATTTTA GTTACAGGGA GAAATAGGGA AAAAATTCCT AAAAATCTAC 5580 

CGAAGTTAAT AGGTAAATTC CCAAATTAAC TTGATTATAT AACTTTCAGT TACTTTGAGA 5640 

AGTTACCGAA AAATATTTTT CATATCTATT GACTTTTAGG GGTAAAATTT GGTATGATAG 5700 

TAGGCGGTAT TGTTTACCCC ATTTGAAAGG CCCCGGAACC TTCCAAATAC TTTTCGATGG 5760 

GAAGGAACAC CCATCACCGT AAACAAAAAT CGAACTATAT ATAGGAGAAA TCATGAACAA 5820 

AACAACATTT ATGGCTAAAC CAGGCCAAGT TGAACGTAAA TGGTACGTAG TTGACGCAAC 5880 

TGATGTACCA CTTGGACGTC TTTCTGCAGT AGTTGCTAGC GT 5922 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1988 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 



TAACTATCTA 


CGATGAGCTG 


TTGTGATTCT 


CATTAGTTCC 


CCTTTCCCAA 


GAGGCATAGG 


60 


GGTGCGCATA 


ATAGATGTGC 


TCCTCAGAAA 


ATATATCAAA 


CAAGCGATTG 


AATTCCGTTC 


120 


CATTATCTGC 


CGTGATGGAA 


AGAATCTTGT 


GTTGTTTTAA 


GATGAGTTTT 


AGAGCCTGAT 


180 


TGACCACCTC 


AGCACTTTTA 


TTTGGAATCA 


ATCGGATGAT 


CTGATGTCTA 


CTCTTTCGAT 


240 


CCGTCAAGAC 


AATCAAGCAG 


TAGTTTTTCG 


ATCTCGTAAG 


TAGAACCGTA 


TCAATCTCAT 


300 


AATGCCCATT 


CTCCAAGCGA 


AGATTGATAG 


CTTCAGGCCG 


CTGTTCGATG 


GATTGACCAG 


360 


CAGGTTTAAA 


GTTGGTGCTA 


GCCTGTTTCT 


TAAGCGCTTT 


TCCTTTTCTA 


GGGTAAAGCA 


420 


AATCCTGCTT 


GCTTAACCCC 


AATTTTCCAT 


GATGAATCCA 


ATAGTAAATG 


GTTGAAATTC 


480 


CCACGTTAAC 


CCCTTTAGCC 


ATAACCATCA 


TTTCAGGCGA 


AAATTTTTGG 


TTATGATAGT 


540 


GGAGAATCTT 


TTCCTTTAGT 


TCCTTGGTCA 


AGCTTGATTT 


CTTGACCGAG 


CGCTTGCGAT 


600 


TGTTTTCATA 


AGACTGTTGA 


GCGTAGTCGG 


CAGAATAAAC 


CTCTTTGAAG 


CGCCCTTTTC 


660 


CAAGACATTG 


TCGGACTGTC 


CCACGCTTGA 


TTTCAGTGTG 


ATAGTTTGAG 


GAGCTTTTCC 


720 


AAGTAGAGAG 


GCAATTTCTC 


TATTTGATTT 


TCCTTCTTTT 


TTCCATCTTT 


CGATTAAGCG 


780 


ACGGCTATCG 


ATTGTCAAAT 


GTTTGGCTTT 


TGTAGTATAA 


TTGTCTTGCA 


TCTCTGTGCC 


840 


TTTCTTGTGT 


TTGTGGTTGA 


ACAACAAGTA 


TAACACAGAG 


GTGCTTTCTT 


ATGCCTACAA 


900 
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GAGCTTTCAT TATTTCCATT TTCTTTTGGA TTTCACTCTA TTCTGAAAAA CTTGTGTATA 960 

TTTACTGAAG CTAGCAAGTC TTACCTGTAA ATTTAATGAA AGCAACACAA AATCCGAGAG 1020 

GGGAATCTCG GATTAATAGA TAGAGAGTTT TTAGTTTAAA TAAATTGTTT AAAATATCAA 1080 

CAACATCACT TCTTTTCTTA ACCTGATAAG TCTTGATTCC TAATTTTGGG GCTACGATTA 1140 

TATTGTCCTC AATATCGTCT AGAAAGACAC AATTTCTAGG TTATAACTGG TATTTATCGA 1200 

.TAGTTACTCA TATACATCAG TCCACCTCCA TACTTATGTG CGAGCCTCTC TTTGTATTAT 1260 

ACCTCCATAC TCACCTTACA GATTCTTTTG GTAATAATAT CTTTGCCTAA TGTAGAGACA 1320 

GTCTTGCAAA GAAAAAACTT CCTTGTAGCC ATGTTTCTGA TAAAAGTCCG GTGCCTGGAA 1380 

CTGGTAAGTA TTGACAAAGG CAAAACAACA ATTTCGATTC TTAGCTTCAC TTTCTGCCTG 1440 

TTGCAATAGT TTTGAACCGA TTCCTTGCCC TCGCAGTTCC TCTTTTACAA ACAAATACTC 1500 

GATTTCTAGC CAATTTCCAA AAGTCTCTGC TATCAAACCT GCCAGGAGAT TGCCCTTTTC 1560 

ATCTTCGACA TAAAGATTAA GTGGCTCACT TTCAGCCTCT TCTCTTTTTG AACGGTTATA 1620 

AACACGAATC AGATTCCCTA TTTCTTGCGA TTTATGTGAT TCCTTATTTT CCAATCTAAA 1680 

GTATAGTGAA ATGAAATAAA ACATGCGCAA ATCGATTAAG GAATTTAATC TAATTTCTAA 1740 

CAATGTCTTA GAAATCAAAG TGTACTATTT TAACTTCAAT GCACTATACA TCTAATACTC 1800 

AATAAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGGT TGCTCAAAAC ACTGTTTTGA 1860 

GGTTGTAGAT AGAAcTGACG AAGTCAGCTC AAAACATAGT TTTGAGGTTG TAGATGAAAC 1920 

TGACGAAGTC GGCTCAAAAC ATGGTTTTGA GGTTGTAGAT GAAACTGACG AAGTCAGCTC 1980 

AAAACAGG 1988 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

CCGGATATTT GTTTTATGTA ATTTTCTTGC AAGTTTCTTC TTAGTAGCTT GTCAGTCAGG 60 

TTCTAATGGT TCTCAGTCTG CTGTGGATGC TATCAAACAA AAAGGGAAAT TAGTTGTGGC 120 

AACCAGTCCT GACTATGCAC CCTTTGAATT TCAATCATTG GTTGATGGAA AGAACCAGGT 180 

AGTCGGTGCA GACATCGACA TGGCTCAGGC TATCGCTGAT GAACTTGGGG TTAAGTTGGA 240 
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AATCTCAAGC ATGAGTTTTG ACAATGTTTT GACCAGTCTT CAAACTGGTA AGGCTGACCT 300 

AGCAGTTGCA GGAATTAGTG CTACTGACGA GAGAAAAGAA GTCTTTGATT TTTCAATCCC 360 

ATACTATGAA AACAAGATTA GTTTCTTGGT TCGTAAGGCT GATGTGGAAA AATACAAGGA 420 

TTTAACTAGC CTAGAAAGTG CTAATATTGC AGCCCAAAAA GGGACTGTTC CAGAATCAAT 4 80 

GGTCAAGGAA CAATTGCCAA AAGTTCAATT AACTTCCCTA ACTAATATGG GTGAAGCAGT 540 

CAATGAATTG CAGGCTGGAA AAATAGATGC TGTTCATATG GATGAGCCTG TTGCACTTAG 600 

TTATGCTGCT AAAAACGCTG GCTTAGCTGT CGCAACTGTC AGCTTGAAGA TGAAGGACGG 660 

CGACGCCAAT GCCGyTGCTC TTAGAAaATA GTGATGATTT GAAAGAAGT 709 



(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 



TATAAAATGT 


TAAGTTAAAT 


GATTTCAAAA 


TTCAGAAAGG 


GATTGCTTTA 


TGCAGTTCCT 


60 


TTTTATTTTA 


ACAGGAGTGA 


AACTATAGTG 


TTTCTAAATT 


GTGAATCAAT 


CAAAACTGAT 


120 


TGTGATGGGG 


CTATTCTAGC 


TTTAGAAACC 


TTCAAAAATT 


AAAATTTAAG 


GCAATCAATT 


180 


ACTTGGAAGA 


GTATGAAAGC 


ATTTAGTTTA 


TAGGAATTCT 


AGGTCTAGAA 


TTACATATAT 


240 


ATATTTATGA 


AGACGGGGTG 


TTCGATAGTT 


AGTATTGTTC 


TATTCTGAAA 


GATTTGAGCT 


300 


GTCAGTTGTA 


TAGAAAGTGT 


TCGAATTTTT 


TTAAGTGATT 


AAATTAGTTA 


ATTGTATGAG 


360 


GTGCTTTATG 


ATATAATGTT 


CTTAATGAAT 


TTTCAGAAAG 


GAAAACCTCA 


AATTGTTCTA 


420 


CAAATTTCTA 


CTCTTCGACC 


TCGACCACAC 


TCTTCTTGAT 


TTTGATGCTG 


CTGAGGATGT 


480 


GGCTTTGACC 


CAACTTCTAA 


AAGAAGAAGG 


AGTTGCGGAT 


ATTCAGGCTT 


ATAAAGATTA 


540 


TTACGTTCCT 


ATGAACAAGG 


CTCTCTGGAA 


AGACTTGGAG 


CTGAAGAAAA 


TCAGTAAACA 


600 


AGAGCTGGTT 


AACACGCGCT 


TTTCTCGTTT 


ATTTGCTCAT 


TTTGGACAGG 


AAAAAGACGG 


660 


TAGTTTTCTT 


GCCCAGCGTT 


ACCAATTTTA 


CCTCGCCCAG 


CAGGGACAAA 


CACTATCGGG 


720 


CGCTCATGAT 


CTCTTGGACA 


GCCTCATTGA 


GCGTGATTAT 


AACTTGTATG 


CTGCGACAAA 


780 


TGGCATTACT 


GCCATTCAGA 


CAGGACGTTT 


GGCTCAATCT 


GGTCTAGCAC 


CTTATTTCAA 


840 


TCAAGTCTTT 


ATCTCAGAAC 


AGTTGCAAAC 


TCAAAAGCCG 


GATGCTCTTT 


TTTATGAAAA 


900 


GATTGGCCAG 


CAAATTGCTG 


GATTTAGTAA 


AGAAAAGACG 


CTGATGATTG 


GAGATTCTCT 


960 
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AACCGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC ACTATCTGGT ATAATCCTCA 1020 

TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA GTCTATTCTT ACCAAGACTT 1080 

GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACA TTTTAAAGGA GACGAGCTAA 1140 

TGACTACAAA AAAGCTAATA TTACTATTGA AGAGTACATT GAAATGTCTG AAGTTGATTT 1200 

TAATGAAGCT GTTAATTATG AATTTACATC TGACACTTGT CAATTAGCAA ATAGTATTTA 1260 

TCAATCTCTT TTTAAGTTTT TTGATAAGAA AAATTTCTCT GGCGATTTAA TTTTTACTTG 1320 

GAAATCTCCA TCATTAGTCA AAGAAGGGGA TTATATTGGG AGAAGGGATT CACAAGTAGA 1380 

TAATCTTAGA GTAATAGGAA ATATATTTCC GAATTATCTT ACTAATCGAA AATATAGCCT 1440 

CAATATGAAT CGTAATGGCT GTATGGGAGA TTTTCCTCAT GACTTTTTTG ATATATACCT 1500 

AGATCATGTA GCAAAATATG CCTACGAACA AAAAGTTAAT AATATTAAAG AGTATTATCC 1560 

TTTAAAAAGA GCGATTTTAC ACCAAGAGAA TGCATTGTAT TTTCGATTTT TTTCTAATTT 1620 

TGACGACTTT TTAGAAAAAA ATTATTTAAA GACTATATGG CAAGTTTCTA AAGAAACTCC 1680 
(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AGAGGATAGA GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 120 

CCTTGATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 180 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 240 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 300 

AATCAATGCT GTAGGTCTGA CTCGCGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 360 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT 420 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 480 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

TCATAGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTGAT CACTTCCTGT ATATCCGG 598 
(2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

CCAGCAAATC AATAACTGCA ATTGCTATAA AATGGATTCT ATAGAGTTTT TTCATGACAA 60 

GACCTCCCTC TTTTATCTAA CTTCATTCTA CTCCAAAAGA ATGGGAGTTA CAACTAAAAT 120 

GATAAAAATA GCAGAAGGGA GATTCTCTTA AGTTGGCTAG TATTCTTTAT TTGAGTTTCC 180 

TTCTATTATC TAACTTCTTC ATCATTCCAG ACAAATAAAG CTCCGATTGC ATTGAGGATA 240 

TAAAAGATGT ATTTACCGAT ATTGGCGAAG TTTCCTTGAA TACCAGCTTT TGTCAGCTGA 300 

ACGAAATTGT AAATCAACCA AAAGCCCCAC TGAGTTGTTA GTTTTAATGC ATTCAAAGCA 360 

TTGGCAATGA GGGACAGTGC AAAGGCAATA GTTGTTACGT AGGCAAGGAG ATTCATCTTG 420 

CCCCCATATC CGATATAGTT GGTCACAAAG GCAAAGAGGA AGGCGATGAT GGAAATGATG 480 

ATGGCCGCCA ATTTTACCTG TTTTTGGCTC ATTTGGTTGG GTCTGCCTTC TTGCGAAGCT 540 

TCCCACTTCT TTATAGCAAA GGTATAAATG AGGAAGGTGA CGGGATAGGT AATGATGGCC 600 

GCCTTATTTC CAAGGATATA ATCAATAGCA CCGGACAAAA TGGTATTAAC AATACCAAAG 660 

TAATTTCCCC ATTTGCTTAA TTTCCCCGTG AAACGAGTGG ACAACATGGA AATCCCAACG 720 

TTGGTTACGG AAATCAATCC AAAGGGTACA AGAGCTGTCC ATGATCCCCA GTCTACAAAT 780 

TTATCGAGGT GTGAGTTGAG GTAACCAGAT GCAATCGCAA TCCCAACGAC CAAAGCAACC 840 

CCGAAGAGGT CAAACTATTT AGATGTAGCA AAAATTTTTA GTGATTTTTT CATAGGTTAA 900 

ACTACCTTTC TTTTTTTCAA ATATTCTCCC ACCAAATGAA AGTAAAATAA AATGATAGAA 960 

ATAAAACCCT GAAAATAAAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT 1020 

TATGGAGCCT ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA 1080 

AACAACTCAT TAGAAAGAT 1099 



(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGGATTCA 


CGTGAAAAGG 


AAGCCCAGAG 


AGTAGCCAGG 


TGTACTGCTA 


GAACAGTGAG 


60 


TGAAATTGAA 


TATTACCATA 


GAGAGTCAAC 


CCAGATAGCT 


CAGGCTTTAG 


TTGAAAATCA 


120 


AGCTCGTATC 


GAGGGAATCT 


ATAAATACTT 


TAGCCTTAGC 


ATGCCAGACT 


ATTTTTACTG 


180 


GCAATTAGAG 


CGGAAAGCTT 


CGCCTTATAT 


ATCAGTCTCT 


CTGTATGAAA 


ATGTTGATGA 


240 


CCTCTATGTT 


CGAAATGATT 


TTGTAACTGG 


GGTGGCCATT 


GCTTTTCAAG 


ATTACAAGGA 


300 


AGTCTATGTT 


TCTACTAAAG 


ACAAACGTAG 


GkkAGAAAAA 


ATCAGGGCTG 


AGGATTTCAA 


360 


ACCAGCAGGA 


AATAGTTTTG 


CCATTCCAGT 


GTCAGATCCA 


GTGTCAGATC 


AAGACTTAGG 


420 


AGTGATTTAC 


ATCTCCTTGG 


ATCCTGCTGT 


TTTATACCAT 


GCCATTGATA 


ATACTAGAGG 


480 


TCATACTCCG 


ATGGCAGTAA 


CAGTGACCTC 


ACCTTTTGAT 


ACGGAGATTT 


TTCATATGGG 


540 


TGAGACAGTT 


GATAAGGAGA 


GTGAAAATTG 


GCTAGTTGGC 


TTAACTTCTC 


ATGGATATCA 


600 


GGTTCAGGTG 


GCAGTTCCTA 


AAAACTTTGT 


TTTACAAGGA 


ACAGTGACTA 


GCTCTGCTTT 


660 


GATTGTGGGT 


TTGAGCCTTC 


TCTTTATTGT 


CATTCTTTAT 


CTGACTTTGA 


GGCAGACTTT 


720 


TGCTAATTAC 


CAAAAGCAGG 


TAGTGGATTT 


AGTAGAATCC 


ATTCAAGTCA 


TTGCTCAAGG 


780 


CGAAGAGGGG 


CGTCGGATTG 


ACATTTCCGA 


GAAAGATCAG 


GAATTACTCC 


TAATCGCGGA 


840 


GACGACCAAT 


GATATGTTGG 


ATCGATTGGA 


AAAGAATATC 


CATGATATTT 


ACCAGTTAGA 


900 


GCTTAGTCAA 


AAAGATGCCA 


ATATGCGAGC 


CTTGCAGGCG 


CAAATCAATC 


CTCATTTTAT 


960 


GTATAATACG 


CTGGAGTTCT 


TGCGCATGTA 


TGCAGTTATG 


CAGAGTCAAG 


ATGAGTTGGC 


1020 


AGATATCATT 


TATGAATTCA 


GTAGTCTCTT 


GCGTAACAAT 


ATTTCCGACG 


AAAGAGAGAC 


1080 


CCTCCTCAAA 


CAGGAATTAG 


AATTTTGCCG 


TAAATACAGC 


TATCTCTGCA 


TGGTTCGCTA 


1140 


TCCCAAGTCC 


ATTGCCTATG 


GTTTCAAGAT 


AGATCCAGAG 


TTAGAGAATA 


TGAAGATTCC 


1200 


CAAGTTTACC 


TTGCAACCGC 


TGGTAGAAAA 


CTATTTCGCG 


CATGGTGTTG 


ACCACAGGCG 


1260 


GACAGATAAT 


GTGATTAGCA 


TCAAGGCTCT 


TAAACAGGAT 


GGTTTTGTGG 


AAATTTTGGT 


1320 


GGTCGATAAT 


GGTAGAGGAA 


TGTCGGCTGA 


AAAGTTGGCA 


AATATCCGAG 


AAAAATTAAG 


1380 


TCAGAGATAT 


TTTGAACACC 


AAGCCAGCTA 


CAGTGATCAA 


AGGCAGTCTA 


TCGGGATTGT 


1440 


CAATGTACAC 


GAGCGTTTTG 


TGCTCTATTT 


TGGAGACCGC 


TATGCCATTA 


CTATAGAGTC 


1500 


TGCAGAGCAA 


GCCGGTGTTC 


AGTATCGTAT 


TACAATTCAA 


GATGAGTAGA 


AAGGGAGAAA 


1560 


ATGTATAAAG 


TATTATTAGT 


AGATGATGAG 


TACATGGTGA 


CAGAAGGTCT 


GAAGCGTTTG 


1620 


ATTCCCTTTG ATAAGTGGGA 


TATGGAGGTC 


GTCGCAACAG 


CCAGTCATGC 


CGATGAAGCT 


1680 


CTAGAATATG 


TTCAGGAAAA 


TCCTGTCGAT 


GTCATCATTT 


CCGATGTCAA 


TATGCCAGAC 


1740 
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AAAACAGGGC TTGATATGAT TCGGGAGATG AAAGAGATCT TACCAGATGC TGCCTATATC 1800 

CTGCTCTCAG GTTATCAGGA GTTTGATTAT GTAAAAAGAG CAATGAACCT TAGTGTGGTG 1860 

GACTATTTGG TCAAGCCTGT TGATAAGGTA GAGCTGGGAA ATCTGCTGGA GAAGATTGCA 1920 

GGTCAGCTCG GCGAGAGAGG GAAGAAAAGT CAGACTCTTA GTCAAGAATT AGACGAGGCT 1980 

GGATTTGTTA GTTATTTAGG GGATAAGGAG AATTGGTGGA TAGGTCTATC CAAGGAAAAA 2040 

CAAGGTTCCT TCACCATTCC CTACTATGTC TTGGGTCAAG ACTGGCAGAT TTTCATTTCT 2100 

GGCCACCCCC TAGATGGTTT AGTCGTTACA CCTTTTGAAG CTCCTTATCA AGAACACTTT 2160 

GAACGCTGGA AGCTGAATGC TGAGAAAACC CTCTTTTACG GTTCTGTAAA TCTGCAGCAG 2220 

TCTGAGAGTC TCTTTGCCTA TTACGAACCG ATTTATAGGG TTATCATTCA GGGAAATCTC 2280 

AATCAAATCG TAGAAGAGTT AAATCTCTTG GAGAAGGTAG TTCTTGAAAA TACACCTCGT 2340 

GTTTCGATTA CTAAACAGCT TTTTATCCAG TTTGTCATGG ATGTTTTCCA TTTATTTGAA 2400 

CATCTCAAAG CTGATGATAT GACGGACATT GTCAAAACCA TTCATGCTAT TCAATCCTTC 2460 

GATGAATTGG TTTCTTATAT CAAGGAAACT CTGATCAGCT TTTTCGGTCA ATACCGTATG 2520 

AATGAAAATG TGGTCAGTGT GCTGGAAGTC ATTGGTCGTG ATTACCAAAA AGAGCTTTCC 2580 

CTCAAGGATA TCAGTAAGGC CCTCTTTATC AATCCTGTCT ATCTAGGGCA GTTGATTAAG 2640 

CGTGAAACCG ATTCGACCTT TGCAGAGTTA CTAAACAAAC AACGTATTAA GGCTGCCCAG 2700 

CAGCTCTTGC TTTCAACTAG TGA 2723 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT ATATAAGTAT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TCCGGCGATA TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCCCTTC GATTGTCAGT ACCTTGAGAG CTGTTTCGAT TGGAAACTGG 360 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 
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TCTAGATTGT TATTGGAAGC AACCGAGTCA GTCGCAATTC CGACTGCTAC TCCCGCTTTT 480 

TGGAGCTGGA TAATTGGAGC AATTCCTGAT GCCAGTTTGA GGTTACTGAT AGGATTGTGG 540 

GCGATAGCnA CTTGAGAAGA TGCCAAGCGT TCAATTTCTC TCTCGTTTAA TTCGACCCCG 600 

TGAGCAAATA CGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 

TTGCCGTATC GTTTGAGGAT AATTCCTGAC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 

AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAAGTC TCTACTGCAG 780 

CTATACGGAG AATGAGGTGC TACCATAACC TTGAAATTTG GATTTTTATA TTTTAA 836 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

ATTTTATTTC ACTTTTTAGG TGGTCTGGGG CTATTCTTAT ATAGCnTCAA GACCATGGGA 60 

GACGGTTTAC AACAAGCTGC TGGAGATCGC CTGGGTTTTT ACATTGACAA ATATACTAGT 120 

AATCCTTTGT TTGGAGTTCT GGTTGGTATT GGGATGACTG CTCTAATTCA GTCTAGTTCT 180 

GGTGTAACAG TTATCACAGT CGGCCTGGTC AGTGCCGGTC TCTTAACCTT ACGTCAGGCT 240 

ATCGGGATTG TCATGGGTGC TAATATTGGG ACAACTGTCA CATCCTTTCT CATCGGTTTT 300 

AAATTAGGTA ACTATGCCCT ACCTATGCTC TTTATCGGTG CCGTCTGTCT TTTTTTTACG 360 

AAAAATCGGA CAGTCAATAA TATCGGACGC ATCCTCTTTG GTGTCGGTGG TATCTTTTTT 420 

GCCCTCAATC TCATGAGCGG CGCAATGGCT CCACTCAAGG ATTTACAGGT CTTTAAGGAC 480 

TATATGATTG AGCTAAGTAA GAATCCTGTT TTGGGTGTCT TTGTCGGTAC TGGCTTGACC 540 

TTGCTAATTC AAGCTTCTTC GGCTACCATT GGGATTTTAC AAAACCTCTA CGCCGGCAAT 600 

CTAATTGATC TACAGGGAGC TTTGCCAGTT CTATTTGGTG ACAATATCGG GACAACCATT 660 

ACAGCCATCA TTGCCTCTTT AGGGGCTAAT ATTGCAGCTA AACGGGTAGC AGGAGCTCAT 720 

GTTGCCTTCA ACGTTATCGG AACAGTTGTC TGCGTTATTT TTCTAGTTCC TTTTACTGTC 780 

CTGATTCATT GGTTTGAAGC TACGCTAAAT CTAGCACCGG AAATGACCAT CGCCTTTGCT 840 

CACGGAACCT TTAATATTAC CAACACCATT GTCCAATTTC CATTTATCGG AGCTCTGGCT 900 

TACTTTGTAA CCAAGATTAT TCCTGGAGAG GACGAGGTTG TCAAATACGA ACCCTTATAT 960 



WO 98/18931 



PCTYUS97/19588 



1306 

CTTGATGAAC ATTTCATCAA ACAGGCCCCA TCTATCGCTC TAGGAAATGC TAAGAAAGAG 1020 

CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA GTACATCATT 1080 

GACTTGGATG AAAAAGTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 1140 

GATGAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCG AAGCTCTCAG CCAAAAAGAA 1200 

AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCGTGATT TGGAACGGAT TGGAGACCAC 1260 

ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 1320 

GCCGCCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 1380 

CTGGATAGTG TGGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATGAA 1440 

GCAATCAATA AGATAGAACG TGTTCTCAGA AAAACCCACA TCAAACGCCT CAACAAAGGC 1500 

GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTCACACTA CACTCGTGTA 1560 

TCAGACCACG CTATGAACCT TGCTGAAAAG GTTTTTGCAG AACAAATCTA AGAACCAAGA 1620 

AGCTATCCAT CATAATTGGA TGGCTTTTTA CTTTTTCCTA AGCAAGACTA GGATGAATGA 1680 

AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 1740 

ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 1800 

TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 1860 

AGAAAGGAAA CTCACATGAA CAGTTTACCA AAT CATC ACT TCCAAAACAA GTCTTTTTAC 1920 

CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT TTTTCAGGAA 1980 

CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 2040 

CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 2100 

ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 2160 

TTGTTGGAAG GAGGGCAGCT TGCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 2220 

GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgGkCGAA TTCTTTTTAc 2280 

AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAC 2335 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 
CGGATTCACT GTTGTTGACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 



60 
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TTGCTTTTCA TACTAGGCAA GTGACTGAGG CTTGTACTTG 


GGTACAGCAA GGGAGCTTAA 


120 


GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAG 


ACTTCATCAT 


TTCGAGAAGT 


180 


GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG 


AATAGAAAAT 


GGCTAAAAAA 


240 


AGCAAACAAC TTCGTGCTGC TCTTGAGAAA ATCGACAGCA 


CAAAAGCATA 




300 


GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC 


TGTAGAAGTT 


360 


GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC 


AAATCCGTGG 


AGCAATGGTA 


420 


TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTGTTT 


TCGCACGTGG 


TGCAAAAGCT 


480 


GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG 


ATGACCTTGT 


TGCTAAAATC 


540 


AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC 


CTGATATGAT 


GGCTCTTGTT 


600 


GGACGTCTTG GACGTGTCCT TGGACCACGT AACTTGATGC 


CAAACCCTAA 


AACTGGTACT 


660 


GTAACAATGG ATGTTGGCAA AGCGGTTGAA GAGTCTAAAG 


GTGGTAAAAT 


CACTTACCGT 


720 


GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 






752 


(2) INFORMATION FOR SEQ ID NO: 277; 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 643 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 
120 
180 
240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 
GTCAACATTG ATTTCAAGGC TGTTTGCTTT CTATCTCCCC TTTTTCATAA TGTATAATAA 
AATGAAATAA TAACAGGACG AATTGATCGG GACAGTCAAA TCGATTTCTA ACAATGTTTT 
AGAAGTAGAG GTGTACTATT CTAGTTTCAA TCTACTATAT TTTCGTACAG GTGCTTCAAC 
CATTTGAACG ATTTCAAATC CTTCTTTTTG GTAAAGATTC TGAGCTCTTT GATTTGCCTC 
GAAGACATTT AGAGAAATAC TGTCTATATC TCTATTTTCA AATGCTAAAC TAACAAATTT 30 0 
CCTTAAAGCC TTGCTACCTA AGCCTTGCTC CTGTTTCTGG GGGTTGATAA AAAATCTCCC 
GATATGAAGA TTGCTGTCTT CTAGCCTGAT TTTCTGGATA AATCCCACAA ACTCTTGTTC 
ATCAAAGATT GAAAAGACTC CTTCCAAGGC TTGAAGTGTC AGTAGAAAAG GAATCCTTGG 
TCCCATCCAT TGTTCTTGAA AGGATTTGCC TAGGGAGTTG GACCACTGGC ATACAAATTG 
AGCGTTTTCT GTGCTCACCT TTTCTTCAAA ACGAATTGTC ATCTTTTCCT CACCACCTTA 
TCTATGTTTC TCCATTATAC TATTTCTCCC ATTTTTTACG AATAGATAAG TATGATTGAT 



360 
420 
480 
540 
600 
660 
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TTTTATTTTT 
TTTCCAGAGC 
CGTCCCTTAG 
TGCAAAGTTC 
CTGCTTTACT 
ACCTTGGGTA 
CAGCTAATCC 
GGAATTTTGG 
TCTTGTGCTC 
ACGGGTATCC 
CTCTTACTTG 
ATTTCTGCCA 
GTACAAAGGG 
AAATTTCATT 
CATTATAGTC 
TTTGCTTGGT 
TTGCGCAATT 
AGCTCGATAC 
ATCAGGATTG 
TGGCAATAAA 
ATTCATATCT 
TCGACGTACA 
TGCGCTATAA 
TTCTAATTTA 
AAGAGCCAAT 
GTCATGAATT 
TACCTTATGT 
TCTAAAGGCA 
TAAATCAACA 
GTTGGACTCA 



TTCTCGTCGG 
AATCCATTAT 
ATTTGATTCC 
GCTATCCTGT 
CTTGGGGATG 
TCTACGACTT 
TAATAGCTTC 
CTCATTTTAT 
TCGTCTTTAG 
TGGCCTTTCT 
CTACTTGTTT 
TATATCCTTC 
ATGATGTTAT 
TCGACTGAAA 
GCTAGACTGA 
TTTTTAACAA 
TGACGATAAA 
TCTAAAGCCT 
CTGACACGCT 
AAACATGATG 
TTATATTTAT 
GATGATTTAT 
GCATCAGCTT 
TAAAGCAAAT 
TCCTCAATGG 
ATAATATTAG 
AAATACATAG 
ATTACAGAAG 
TGAAATTGGT 
AACTGCCAAA 



GAGCATTCTA 
CAGTTCAGCC 
GATTCTCTCA 
CTGGTATGCC 
GCTCTCCTTG 
TCACCATCAG 
CTCTGGCTGG 
CGATATCCGC 
CGTAACGGAG 
CCTGCAAAAG 
GATTATTTTT 
ATGAAATTAT 
CAAATCGATC 
ATATTTCGCT 
CTAGATGATT 
AAATTTGATC 
AGTAAGAATG 
GTATCGAAAC 
TGCCAACCCT 
CAAAATAATT 
GTAAAAGAAT 
TCGTTCCTAA 
GGCCATTAAT 
CATGATTATC 
ATTCTCCCTT 
GTATAATTAC 
TTTGAATATC 
AATCAAATCG 
TGGCCAAATG 
TGGCTTGTTC 



1308 
GCTTCCTTTC 



TTGGTTTGGT CATTGACCGT 720 
AGTCACTGCG ATTCCTGTCA GACTCCCTTG 780 
CAGGTCTTCA ATCGCTTTCG CTGTCGCTAC 840 
CTCTTTGAAT TAAGCTTAGG ACTCCTCTTT 900 
GGGCAAGTCG TCCTAATCAC CGCTGGTTTG 960 
GAATATCCCT TACTGGTCTG GATGACTTTC 1020 
AATCTGGTCA TGGTCTCCTT CCTCATACTT 1080 
ATGGGTGCAG GGGATTTCCT CTTTTTAGCT 1140 

TTACTGATCT TGATTCAGTT CGCTTCTGCG 1200 

AAAAAGGAAA GACTTCCTTT CGTGCCTTTC 1260 

GGTAAGCTAC TGCTTGTCTG ATAAAATCCA 1320 

TTCACAGTTA AATTATAAAT TATTTCTTTT 1380 

TGTTCTTCTA TCTTCTTGAT ACTGATCAAA 1440 

TATAAACTGT AAACGAATAC TTTGTTTAGA 1500 

ACTCAAAACG ACGTCCAGAA TACTCTTTAC 1560 

ATCCAAGGGT TCAATCATTT TGTAACCTTT 1620 

TTGCTTTGGA GTCAATAATC CTAACTTAAA 1680 

ATTCAAATCC GACTTCAATA AAATATAACT 1740 

CTCTTCAAAT TTGACTAAAA ACTCTTCTTT 1800 

TGCTTCTTGC TCCAAACGAT CGCCATCTTC I860 

ATGTCCTAGC TCATGAGCTA AGTCAAAATT 1920 

CACAATATAA GGTCTTCCCA ATTTTGACCA 1980 

TAATCGTTCC ACGATATAGA TGCCTGAACG 2040 

TTTTGAAATA CCTAATTTTT CCCTGGCATA :i00 

ATGATAAGAT TCACTCACTA CATTACTTAG 2160 

AAAACTTTCA AAATAATCAA TCAAACTATC 2220 

TATTGTTTTC CGTGTTGCTA GGTCTGCATT 2280 

AATGCTCTCT TCTTCCTGTT CAAAATAAGT 2340 

CATTTTGGTT GATAATTTAG GTTTCGTTTC 2400 

CGTTAAATTA ATTCTCTGAG CTAATTCTGC 2460 
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TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA ACATTTACAT 2520 

ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 2580 

GTTTCCGATA ATTCTGAAAA ATTAGGAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 2640 



(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TGACCAGTGG CAAAATGGCT ATCCAAATGC AGATGTTATT ATCGATGATA TCATCTCAGG 60 

GCAAGCCTAC GTAGCCTTGG AAGAGGGAGA ACTGCTAGCC TATGCTGCTG TGACCAAGAG 120 

TCCAGAGGAG GCCTATGAAG CTATTTATGA GGGAAACTGG CAAGCTGGAG AGTCAGAGTA 180 

TCTAGTCTTT CACCGTATTG CTGTGGCAGC AGATGTGCAG GGAAaAGGAG TTGCTCAAAC 240 

CTTCTTAGAG GGCTTGATTG AAGGTTTTGA TTATCTTGAT TTTCGCTCAG ATACGCATGC 300 

TGAAAACAAG GTTATGCAAC ATATTTTTGA AAAACTTGGT TTTAAACAAG TCGGTAAGAT 3 60 

GCCAGTAGAT GGCGAACGCT TGGCCTATCA AAAATTAAAG AAATAATGCA AAAGAAGTAT 420 

GTAAAAATCC TCTACTCCTC ACCAATTGGT ATTCTATCAC TTGTAGCTGA TGACCATTAT 4 80 

TTGTATGGAA TTTGGGTTCA GGAGCAGAAG CATTTTGAGA GGGGACTAGG AGATGAAACG 540 

ATAGAAGAAG TTGTwAGTCA TCCTATTTTA GACCCAGTTA TT 582 
(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

CCCAAGCTAC TAAGAGACTA AAACTTGCTA GAGAAGCAAG AGAAAGTGTG AATCTTTTTA 60 

ATTTCATGAT GAATTTCCTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 120 

CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 180 
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GCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 
TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAACCTGT 
CACATCCCTT TCGTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 
AGGCAATTTT TCAGTCAGAT AACCCAACAT CCCTTGATAA TCCTTACGTT TCAAGGATCC 
AAAGAGGATT TGAGGTCGAT AGCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 
AGTCAAGGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TGTGGGCGAA TACGCTCCAA 
GCGAsCAGCC CAAT 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

CCGGTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATGCTACACT ATCAATATGA 60 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GAGGCAATTC 120 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 180 

TGGAGAGTTT CTACAAGGTA TCAGCTGAGA AAGCGGCGGA AGAATTTCAA AATATCCAAG 240 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 300 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 360 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 420 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAGG 480 

CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT 540 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGAGG 600 

ATAGAGGCGG TCAGCTGAAG ATTCACTCAA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 660 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 720 

CTGGATTTGT TTACCGAGAA GATTTGTCAC AACCACAGGG GGATGG 766 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



WO 98/18931 



PCI7US97/19588 



1311 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

CCGGCCACGG TTCCATCCAA CTTCACAGGT GTGCACTTGA TTGTGTATGT AATTGTCACT 60 

AACGGTAGAA TTTCACCTAT CCCTCCTATC TGCTCGCAGT ACCCGCAGAC TTTCTGAAAG 120 

AAGAAGATAA CCTACTTATC CGTTGCTATG ATTATACTAA AGTTTCTACT TTTTTGCAAA 180 

TAGATTTTTA AATTTTTGGC TAATTGTCTG AATCAGGGTC GGAAGTTTGA CGACCTTGTC 240 

ATTGCCTAGT TTTTCGCGTG CAATTTTGAG AATGGCACCT GAGTCTTTTG AAGCAAAGAG 300 

GAATTTTCCT TTGTCTGTAA AGACTTCGAA GTGGCGGCTG ATTTTGCGTC CAGTGACATT 360 

GGCTCCAATC TGATTGATAT GGCTCCAAGG AATCTGGATA AATTGTTCGA CATTGACATC 420 

TGGGTAAAAT TCCAAAGCCT GATCTCCGAC AAGGAATTTC CCAACTTTCC CAGCGATAGA 480 

GAGGTAGGAA GTGCCTGTCG TACTGAGGAG TACTGTTTTG TTAAGTGATT GGGCCATGCT 540 

TAGTCTTCCT TACTTTCTCC AAAAAAGGCA TTGTAGAGGG CTTTAATTGC TGCTTTCTCT 600 

TGGTCTTTAT TGACAACAAA CATAATAGAA ACTTCACTAG AACCTTGAGA CATCATCTGG 660 

ATGTTGATTT TGTTTTCAGA TAGAGCGCGT GTCGCAGTAG CAGTCACTCC GATATGGCTC 720 

TTCATTTTTT CACCAACAAT CATAATGATA GAAAGGTCGT GTTCGATTTC TGCATGATCT 780 

ACTTTAGCCT TTTGAACCAA CTGACGCAGG ATTTCTTCTT CCTTGATGGG AGTTAGTTGG 840 

CGAGAACGGA GAATGATAGA AAGAwCGTCG ATACCTGTTG GCATATGTTC CCAACCGATG 900 

T 901 
(2) INFORMATION FOR SEQ ID NO: 282: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1765 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGGATTT CCTAACAAAT CCACCGCTTG 60 

CTGCATTCGA CCCAAACCTG ATCGAAAATT CAAACCAATC CGACTATGGA GCCATTCTTC 120 

TACTTCAAAC ATACACATCT CCTTGACAAA AGTCCAATCA ATTATCGCAT TAAAGTATGG 180 

TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 240 

TAGCTACTAT GAATGTGAAT ATGGGCTAAA AACATCCACT GGACGTTCCA ACTCTTCCCC 300 
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ATTTCTGGGA GTTGGGGTAA AAATGTTCAC TGGACGTTCC AACTCTTCCC CATTTCTGGG 360 

AGTTGGGCTG ATACAGTCTC CCAGACTGTA TCACTCCTCC ATAAAGCTGT TGAAGACTTC 420 

TTCAATCATG TTCCATTCGT CTTCTGAGTC TTCTGGGATT GGTTGCAATT CGCCTTCTGT 480 

TCCATCTTCG TTTTCGATGA ATGAGTAAGC TTGGATTTCA ACTTGTCCGT CTTCGTCTTC 540 

TTCTGCGTTA ACTGGTACTA GAAGAACATA GTTTTTACCA AATTCTTCTT TTCCATCAAT 600 

TGTCAAAAGG ATTTCAAACA AGGTTTCATT TCCTTGCTCA TCTACTAGTG TGATTAGTTC 660 

ACGTTCTTCG TGGTCGTGGT TATGATCGTG TGACATAGCC TCGCCTTTAT ATTAAAATTT 720 

TCTATCTAAA TAATTTTGTA AAATCAGCTG AGCTGCTAAC TTATCAATGA CTTTCTTGCG 780 

CTTATTGCGA CTGATATCTG CTTGTTCAAT CAACATGCGC TCAGCAGCCA CTGTTGTCAA 840 

GCGTTCATCC TGATAGTCTA CTGGTAAACC AAAAAACTCT TCTAGCTTTG CTCCGTAGCT 900 

TGACTAGCTT CTACGCGCGG TCCACTTGTA TTGTTCATGT TTTTAGGCAA GCCCACTACA 960 

AATCGTTCCA CCTTGTAAGT ATCAACCAAT TCCTTAACGC GGTCAAAACC AAATTGGCCT 1020 

TGTTCTTCAT TTATCTGGAT GATTTCAAGC CCTTGAGCTG TAAAACCAAG CGGATCGCTA 1080 

ATCGCCACCC CTACCGTTTT TGAACCGACG TCCAATCCCA TAATTCTCAT AGGTTATAGA 1140 

TCGACTCCTT GTCCTTTGAG GTAGTAGCGA ACCAATTCCT CAACGATTTC ATCACGCTCA 1200 

TACTTACGGA TTTGATTTCG TGCATTATTA TAACGAGGAA CGTAGGCAGG GTCTCCACTC 1260 

AATACGTAAC CTACGATTTG GTTAATTGGG TTGTAaCCCT TATCGTTCAA CGAAGCATAA 1320 

ACATCTGTCA AAGTTTCGCT AATTTCTTTT TTATTGGAAT CGTCCAATTT AAAACGTACT 1380 

GTTTCTTCAG TAAATCCCAT TCTAACACCC TCTTTCCTTA GAATAGTACC ATTATAGCAT 1440 

AATTCCTTAC CTTCTACAAT TCAGGCAGTC TATTTATTTG GATTTTCTAT TGTTCTGTCG 1500 

CGCCATTTGC CAATCTATCT GAAATATATT TGCTTGGTTC ATTTTTCAAA AGATTTTCCA 1560 

AACCAATATT CTTCAGATGT TCCAACTGGG AAGCCTTCTT GACATCCAGA ACTTGAAAAT 1620 

CAAAACTAGT CGTTGTTTGA AGTTCCGTTG CGCTCAATAG TTTTGTTTCA AGTTTGAAAC 1680 

CTGCCAATTT ACGAGCTTCA ATGATAGACT TATCCTTCTC CTCCGCTTCA AGAAGAGCTT 1740 

TTTGAGTTTC CTCCACTCCA TGTTG 1765 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1346 base pairs 
(B> TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 283: 



CTTATCCATT 


CACTTTCTTG 


TCTGTTATTC 


TATAAATCTT 


ACTCCTAAGT 


ATACCACATT 


60 


TGCCCCTAGA 


TGTGAACGAG 


AGAAACGCTC 


TAGACATTGC 


CAAGAAGGAA 


AAAAAAGGGT 


120 


ACAATGTAAC 


AAAATCAAGG 


GAGGTCTGGA 


ATGAAGAAAC 


AAAGCAAGTA 


CAAAGAGGTC 


180 


GTTTCCTATC 


TGAAAAATGG 


TATCGAGTCT 


GGACGATTTC 


CGACGGGTAG 


TCGCCTGCCT 


240 


TCTATCCGTC 


AACTGAGCCT 


TGACTTTCAC 


TGCAGCAAGG 


ACACCATTCA 


ACGAGCCCTG 


300 


CTGGAATTAC 


GGCACGAACA 


ATACCTCTAT 


GCCAAGCCTC 


AGAGTGGCTA 


CTATGTATTA 


360 


GAACAAGGGC 


AACATCAAGA 


CCTAGAAATC 


GAGGTTACCG 


ACGAACATGC 


CAGTGCCTAT 


420 


GACGATTTCC 


GACTCTGTGT 


CAATGAAACC 


TTGATTGGCC 


GAGAAAACTA 


CCTCTTCAAC 


480 


TACTATGACA ATCAAGAAGG 


ATTAGAAGAC 


CTAAGACAGT 


CCATTCACAA ACTCCTCTTT 


540 


GAGCAAGCTC 


TCTACTGCAA 


GGCTAACCAA 


CTAGTACTGA 


CTTCTGGAAC 


CCAACAAGCC 


600 


TTGTTTATCC 


TCTCTCAAAT 


ATCCTTTCCT 


AGACAAGCCA 


AGGAAATCTT 


GGTGGAACAG 


660 


CCAACCTACC 


ATCGGATGAA 


TCGCCTCTTG 


ATTGCACAGG 


GGCTGGACTA 


TCAAACGATT 


720 


GAACGAGGCA 


TTGATGGGAT 


TGACTTGGAG 


GAGCTGGAAG 


GCCACTTCAA 


AACAGGAAAA 


780 


ATTAAGTTTT 


TCTACACCAT 


TCCCCGATTT 


CACTATCCCC 


TGGGACATTC 


CTATTCTGAG 


840 


CAAGACAAAC 


GATCTATTCT 


TAACTTAGCT 


GCCAAGTATG 


ATGTCTATAT 


CGTAGAGGAC 


900 


GATTATCTGG 


GTGATTTGGA 


CTCCAAGAAG 


GGCCAAACCT 


TCCACTATCT 


TGATACAGAG 


960 


GAGCGTGTCA 


TTTATATCAA 


GTCCTTCTCG 


ACCAGCCTTT 


TTCCTGCCCT 


TCGTATTACA 


1020 


GCACTCATTC 


TTCCAAATGC 


TATCAAAGAA 


GCATTTGTGG 


CCTACAAAAA 


TATCCTAGAC 


1080 


TACGACAGCA ACCTCATTAT GCAAAAGGCC CTGTCACTCT ATATTGACAG 


TCAATTGTTT 


1140 


GAAAAAAATC 


GTTTGGCTCG 


CTTGACCAAT 


CATGAATCTT 


ACCAAAAACA 


AATCGAGGAA 


1200 


AGGATAACTA 


AAACACCTTG 


TCCCCTTCCT 


CATTATTCCC 


TACACGATGG yTTATTGCTA 


1260 


GACCTGAGAC 


AGTATCCTAA 


AATCGCCAGT 


CTCAAACACA 


GTCAACTGGG 


cTTGGACTTC 


1320 


TTTGAAGAGG 


CCTATTTAAG 


CACCTG 








1346 


(2) INFORMATION FOR SEQ ID NO: 284: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CTATATTCAG AATATGCCAA AAATTCGGAA TGGTATAAAT TTGCGGAGGG TTCATTTGAC 60 

ATATTTAGAA AACTCCCCCA AAGAATTAAT TTTAAGAAAG ATTTTTCTAG AATTTTGGCC 120 

CCCTTTATTA TTAATTTGCT TAAATTAATC AATAATTATC TAGAGAATAA AGAATACGAG 180 

TGGATTGACA AGAATGGAAA TATTTTTTCC TCTCTAGTAT TTTATTTAGA AGATTTAATC 240 

TATCCTTGGA TTGTTAAACC TTTGGTTTTA GAGATAAATT CATTGCGTGA AAAAGGTTTA 300 

CTTGAAGGGG AATCGGAGCA GCAACGGTAC AAATATTTTA TAACATTGTT TGACAAGGAA 360 

GAGAATATAT TAAATTTTTA TAACAAATAT CCCGTTTTAC TGAGGCAAAT ATCGGAGTCT 420 

TGTCTTCGGT TCTATACTTA TTTTATAGAA ATTTTATCAA ATTTAGAAAA TGATTTTAGT 480 

GTGCTAGAAG AAGAATTAGG GCTAAGGGGG AAATTAAATG ATATAAAATT TGGAAAGGGT 540 

GATACACACA GCCAAGGAAA AACTGTTTTG ATACTCTTCT TTGATGACGC GAAAATTGTT 600 

TACAAGCCTA AAAATTTAAT AATCAATAAC TCACTAAATA CTATTGCTGA GTATATCCGA 660 

AAGGTTGATG AAAAAATTAG GATAAGAATA CCTCGAACTA TTGCTTATTC GGATCACAGC 720 

TATGAAGAAT TTATTGATTA TCTACCTCTA GAGCAAAAGA AAAATTTACC TGAATATTAT 780 

TATAATTTTG GTGTGCTTTT AGCATTTATA TATTTATTTA ATGGGAGTGA TATACATTTT 840 

GAAAATTTAA TTTCCTATGG AGATATGCCT GTAATAATAG ACTTTGAAAC AATGTTACGG 900 



(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

TTATTTAGCA GAGGCAGTTT TAAATGTGAA GGATTTGGTC AGTCAAACAG TTTTTTATCA 60 

GCAGATTATT GGTTTAGAAA TCCTATCTCA AACGGATACA GAGGTCGTTC TGGGACTTGG 120 

AGGAAAAGCC TTGGTACACT TGATTCAAGC ACAAGAGGGT GGAGAAGTAA GGGAACATTA 180 

TGGTCTTTAC CATCTGGCTA TTCTTTTGCC GACACGAAAG GCTTTGGCGG ATGTCTTGAA 240 

GCACCTGACG GATTTACAGA TTCCTCTTGT TGGCGGTGCA GATCACGGTT ACAGTGAGGC 300 

CCTTTACTTA GAGGACTTGG AGGGAAATGG CATTGAACTC TATCGAGATA AGCCAGTTTC 360 

CACATGGGAT ATTCGAGAAG ATGGACGTAT TATCGGGGTG ACTGAAGTCC TTGCGGCTCA 420 

GGATATCTAT GAGTTGGGGG AAAGAGTAGA GCCTTTTATC CTAGCAGAGG GTACGAGAAT 480 
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GGGGCATATT CATCTTTCTG TCAAGGATAG TCGAAAGTCC AGACAGTTTT ATCAAACGGT 540 

GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TAGTGCTAGT TGGATCGCAG CTGGGGACTA 600 

CCATCATCAT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTGGATCCGC GTAAACAAGT 660 

CCTACCAGGT TTAGCCTACT ATGTCATCGA AGTCGCACAT AAAGAAGAAC TGTTAACGAT 720 

TGCCCAACGA GCACAAGAAG TTGACGCACC AATCAAATGG ATGACATCGA TCCAATTGGA 780 

AATCACAGAC TCAGATGGCA TCGTGACCCG TATTCGTTTA GCTAGATAGA TGGTATGTGA 840 

TGAAGGTAGA GCATCAATTG TA 862 

(2) INFORMATION FOR SEQ ID NO: 286: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 650 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

TCGTTTACAA GATCGCTAAA ATGCATCTCA TGATCGCGAC CACGAATTCC AAGATAGCAC 60 

GCGCTACCTC AATCATAGAT AGTTCACTTT TTTCTTGCCC AGCAAATACT TCTAATTCCA 120 

AAGCGTTTCT CCTCATTTAT ACTACTATCG CCAGAGCGAA CAGACTCTGA CCTCATTTTA 180 

TCATTTACTC TTTATTTTAC GATAATTTTG CGGAATAGTC AAAGGTTAAG GGGGAGAAAG 240 

TGGCAGGATT AGACTAATTC CAATATAAAA CTCATTCCTT TTTCTGTTGC TCCATTTTCC 300 

ACAAATCCAA GCGACTTGAA ACACCTCCTA GAAGCATGAT TGTAGGTGTA GATTTTCTTG 360 

ACTCTCAATT CTTTCCATCC TTTTACTCGA GCCAATTCAA TCAAAGCACT TAGAATCTTT 420 

TTTCCAAGTC CTCGATGTTG GTAAGCGGAA TTCCCAATCA CAATGGGGAG ATTATCCTGA 480 

GATAGTGTAA TATCCCCAAT TGGAAACCAT TCTCCCTTCT CCTTGACTTC AATCCAAAAA 540 

AGCTCACCAT GCCGATyCAr ATAGGAATAC ATGGCTTCCA AGGTCGcTtG ACTGTAAGGA 600 

AGCTTCACCC CATCTACGAG GtAAcCAAGT TCACATCCGT GATACCAAGC 650 
(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



WO 98/18931 



PCT/US97/19588 



1316 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 287: 



vjA 1 AG (_ AA 1 \. 


UGUTTCAGAA 


ACTTCTCGCT 


TACCTCTAAC 


TCCGATCGCT 


AGTTTGGGAG 


60 


J\ Afftmii /~»rp/Ti/-» 

AAOA 1 AtrTTC 


CATTCTCATA 


CTATCTGTTG 


GCTTTGCAGG 


CTGTAAAAAC 


AACTTTTCTC 


120 


TTGCTACTTC 


CTGAAAATCT 


GAATCTTGCA 


GTTCTTTGCT 


TTCAAAATAG 


TCCTGTACTC 


180 


GC acatc 


AAAATTCCCA 


GCTAAAGACA 


GAGACATGTT 


TACAGGTTTG 


TAAAACTTTG 


240 


TAAAATTTTC 


TTGCAAATTA 


GTTAGATTGA 


TTTGGGAAAT 


GGACTCCTCA 


CTTCCAACTA 


300 


I ATCAGTTGC 


TAAAGGTGTA 


CCAGGATACA 


AATTCGCTAA 


AGTTGAAAAG 


AATAAACACG 


360 


AA1 (JTGGATC 


ATCTTGGTAC 


ATTTCTCGTT 


CTTGCTGAAT 


AATATCCTGC 


TCTGTCAGAA 


420 


rGGAAGCTTC 


AGTAAAGTGT 


GCTGATGTTA 


CCAATTCATC 


AAGTAAATCT 


AAATTTTCTA 


480 


AAAAATAATC 


CGTTGCTGAA 


AAAAGATAGT 


TTGTTTTTGT 


AAAGCTTGTA 


AAGGCATTAC 


540 


TATCTGCACC 


TAGACTCGTA 


AAAGCCGACA 


TCAAATCACT 


AGAATCTTCT 


CTCTCAAATA 


600 


ATTTATGTTC 


AAGAAAATGA 


GCAATTCCTC 


CAGGATATTG 


TTTTACATCT 


CCGTCAACTT 


660 


CTGTGACAAA 


CGTATCTACC 


GAACCAAACT 


GTACAGTGAC 


ACTCCCGTAA 


ACCTCTTTAA 


720 


ATTCCTTTTT 


AGGCAAAAGA 


GCAACTGTCA 


ATCCGTTGGC 


CAAACGAGTT 


CGATAAACCA 


780 


TTTCTTTTAC 


AGCTGGATAG 


TATTTTTCTT 


CAAAAACAAC 


CTTTGTCATT 


CTATTCCTTC 


840 


CATAAAGTAA 


ATCGCTTGTA 


GTTTCACATT 


ATTAGCTACT 


CTACAAATAG 


CATCTTTGTC 


900 


AATTTGTTCA 


AGCTTTGCAA 


TCCAACTTTT 


AAAGTCTGCT 


GAAGATTTTC 


CAAATAAGGC 


960 


ATTTTGATAA 


GCACGTTCAA 


TCAATGAAGA 


ATGATTATCT 


TGAGAAAGTA 


ACAACGACCA 


1020 


ACGAATCATT 


TCCTTGGTCT 


GATTTAACTC 


AAACTCTGTA 


AAAAAACCTT 


TTTTTAAATC 


1080 


AAGCCGTTGA 


TTATTCATCA 


ATTTACGAGC 


CTGGTTACG 






1119 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGGCCGCCG AGGAGTGGGG 60 

GGTGCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCCGGACAA 180 

GGCAGACGGA CTCCAAAGGA GTTCTTCCAT CTGCAAGTGC GCCTGCGTTA TGTGATCCCG 240 
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GTCTTTTGCA TGTGTGTGGC ATGAATGCTG TTCCCAATCC CACTCCAGAA CATTCTCCTC 300 

AAAAGTGCGC AACGTCGCCC TGAATGAATC CTGCCTTGTA GTCGTGACCA TTCCTATGAA 360 

GGGTCGCAGA GGATTTTCCC CGAGTGCAAG CGCATCCTCC GGCTCAAATC GGGTGCATTT 420 

CACAGTCCCG CTCAACGCTA GCCCGATCCC TTTTTGGCAT GGTGACTCAA GCGTCCTTTC 480 

AAACAAAAGC TCCTCATCCG CTCCAACCGG CCCGACGTAG ACGCGTAGAC CGAAGTCGTC 540 



.(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

AAAGAATTCG ACCAATTCAA GGTTGAGGCA TCGCAAACTA TGGACTGTTC CCCCGTCAGT 60 

TCTGGACAGA AAACGGGATA AGGTTGGCTG TGAAGCAAGC TGCCCTCCTA CCAACAATTT 120 

TGGAAAGTAG GCATCAGCTG ACAATTCTTT ACAAGCATAG TCCGTTCCAT AACCTGTTAA 180 

CAGTTGAAAG AGGAACTGGA CAAGGATATC TGAATCCGAA TAACGACAGT AGCGGCGTTG 240 

GTCATTCGTT ACTAAATACT TAGAAATCCG CTCTTTTAGT TTCAACTGGG AAAAAAGTTC 300 

CTGAAAAAAG ATAAGACCAC CATACTGGGT TAAATGACCT CCATCGAAAG ATAGTTGGTA 360 

AAAAGACTTG TTTTGGAAGT GATGATTTGG TAAACTGTTC ATGTGAGTTT CCTTTCTTTT 420 

TGTGTTTTTT TCTACACTTA TACCATAAAG GGGAAACTCT TTTTTGTCTA GTAAAAAACA 480 

CCCATTGGGT GAAAAAAGAA ACCATCCAGG ATCTAAGCTA AGGCAAGGAT TCTGGATGGT 540 

TTTTAGATTT GGGGTGAATA ATTGGGGTTT TACAATATCA ACTCCCATGA TAGTCATGAG 600 

ATGACTCTTC ACGAATTGAC GTGATGACTG TCCTTCCTTT TGCATAATTA CCTCCGAAAC 660 

ACAAAAAAAG GGGTAGACAA TCTAGTGTCT ACCCCCGAAA GTTTATTAAA ACAAAAATCC 720 

TGCCAAAGAA TTTTTGGCAG GAAACCAAAT CAATTTATCA GTTTCTATCA ATCGCTTATC 780 

GCTCTCAAAG ACTGGTAAAT AGGGATTCCG CAATCAAATT GCGATACTCT ATTATTTAAG 840 

AGTAACTGAA GCTCCAGCTT CTTCCAATTT AGCTTTGATT TCTTCAGCTT CTGCAGTTGC 900 

AACGCCTTCT TTAACAAGTG CTGGTGCACC GTCAACAAGT TCTTTAGCTT CTTTAAGACC 960 

AAGACCAGTG ATTTCACGTA CAACTTTGAT AACGCCAACT TTTTTGTCGC CTGCAGATGT 1020 

CAATTCAACG TCGAATGAAT CTTTAGCAGC ACCAGCATCA GCTGCATCAG CTGCAGCAAC 1080 
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AGCTACAGGA GCAGCTGCAG TTACACCAAA TTCTTCTTCG ATAGCTTTTA CAAGGTCGTT 1140 

CAATTCAAGG ATTGAAGCTT CTTTAATTTC AGCAATAATG TTTTCAATGT TCAATGCCAT 1200 

TGTTATTTCC TCCAAATAAG TTTTAAATTT TATAATAGTT TTTTTCGTAG CTAGJcsTACG 1260 

CTGTGTAGCT TAAGATTAAG CCGCGTCTTC TTTGCTTTCT GCAACCGCTT TGACTGCAAG 1320 

AGCAACGTTG CGCACTGGCG CTTGAAGTAC AGAAAGGAGC ATAGAAAGAA GTCCTTCGCG 1380 

GTTTGGAAGA GTTGCAAGTG CAAGAATCTC TTCTTTAGAT GCGACAGCGC CTTCGATTGC 1440 

ACCACCTTTA ATTTCAAGTG CTTCAGCGTT TTTAGAAAAG TCGTTCAAGA TTTTCGCTGG 1500 

TGCGATAACA TCTTCATTAG AAAATGCTAC TGCAGATGGT CCAACAAATA CAGATGCAAG 1560 

ATCTTCAAGA CCAGCTTTTT CAGCTGCACG ACGCAAGATT GAGTTTTTAA TAACTTTATA 1620 

CTCAACTTCG CTTCCACGAA GCTCACGACG AAGAACTGTA TCTTGCTCAA CTGTCAAACC 1680 

ACGAGCGTCT ACAACGACGA TAGATGCAGC AGCTTTCATT TTTTCAGCTA tACGTCAACT 1740 

AGTTCCGCTT TTTTAGCAAT AATTGCTTCA CTCATTAGTG TGTTCACCTC CGTAATTATT 1800 

TTGCTTGGGG AATTTTTCAA AAAGAAAAAC GCGCCCAATC CTAGACACGA AAGTACAATA 1860 

CGCTTCTTTT TACATGATAC GTTTTGTCCT CGGTAGGATA TTTATGAGTC GAGCTCCCCT 1920 

ACTGTCTTAG GCAGTTTTTT TAGATACGG 1949 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTGGGTGAA 120 

CACGGCACTC GTGTGAGCAT CCACGAAGGT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA GTTGGGTTGA CATGGTTGAC ATCAACGGCA ATACGCTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 360 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 480 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACTCACGGAA GTTGCGAATA TCTGGGCGAA 540 
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TAGCTCCACG GAAATGTCCT AGGTCGTACT CATAATCGTT ACGTGTGTCA AGGACAACGG 600 

TATCTTTATC AAGAAGCGCT TCTTTGAACT CTTTTGGAGA CAAGTAAGCA CCTGTTGTTT 660 

CAAGTGGGTT GATGTCATTG TCAAAGTCGT TGTCTTCCAA ACCAAGGTGG ACAATTTCTT 720 

TCTTGTAGCG AACAAACATC TTCTTGAAGG CTTGTTCATT TTCTTCGTCA ATCTTGAACC 780 

AGAGTTCTTC CATTCCTGGA AGGCTGTGAA CGTAgTCCAT GTATTTTTGA GTTGTTTCAT 840 

. AGTCACCTGA AACTGTTCCG TTAATTCCCT CGTCAGCGAC TAGGATACGG CCTTTAAGGn 900 

CGATTGATTT ACAGAAAGCC AAGTGGTCTG CAGCAAATTG CTCTGCATTT TCAATTGGAG 960 

TATAAAGGTA GTAAAGTAAG ACACGAATAT CTTTTGkCaw AAGATTTGTA TCTCTTTATC 1020 



(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 



ACTATGAACA 


AGACCCAGAA 


AAAGTAGCCT 


TATTTCTTAA 


GAATTTTAAT 


AGTTTAAAGC 


60 


ACCTAGCACC 


TGTTTAGATT 


GACGAAACAG 


GATTCGATAC 


TTATTTTTAT 


CGAGAATATG 


120 


GTCGCTCATT 


AAAAGGTCAA 


TTAATAAGAG 


GCAAAGTATC 


TGGAAGAAGA 


TATCAGAGGA 


180 


TTTCTTTGGT 


TGCAGGTCTA 


ACAAATGGTG 


AATTAATCGC 


TCCAATGACT 


TACGAAGAGA 


240 


CGATGACGAG 


CGACTTTTTT 


GAAGCTTGGT 


TTCAGAATTT 


TCTCTTACCA 


ACATTAAACA 


300 


CACCATCGGT 


TATTATTATG 


GATAATGTAA 


GATTCCATAG 


AATGGGGAAG 


CTAGAACTTT 


360 


TATGCGAAGA 


GTTTGGGCAT 


AAACTTTTAC 


CTCTTCCTCC 


CTACTCGCCT 


GAGTACAATC 


420 


CTATTGAGAA 


AACATGGGCT 


CATATCAAAA 


AGCACCTCAA 


AAAGGTATTA 


CCAAGTTGCA 


480 


ATACCTTTTA 


CGAGGCTTTT 


TTATCCTGCT 


CTTGTTTCAA 


TTGACTATAT 


TAGAGGCGAG 


540 


ACATTTTTCG 


GTTCTTTGTC 


AACTGTAGTG 


GGTTGAAGAA 


AGCGAAGATC 


TAGAAAGGAC 


600 


AAATTTCGTC 


CTTTCTTTTT 


TGAAGTTTTC 


AAAGTTCCTA 


AAACCAAAGG 


CATTGTGCTT 


660 


GATAAGTTTG 


ATGAGATTAT 


TGGTGGCTTC 


CAGTTTGGCG 


TTGGAATAAG 


GTAATTGAAG 


720 


GGCGTTGACG 


ATTTTCTCTT 


TATCTTTGAG 


GAAGGTTTTA 


AACAAAGTCT 


GAAACAGAGG 


780 


TGGAAAAGCA 


AGAGCTGATA 


GAGATTATAG 


TGGTGTTTAA 


AGTCTTCGGA 


ATAGCTCAAA 


840 
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AGTTTATCTA 
TTATCACTCA 
TATTCATGGG 
GTTTATCAAA 
CCGGACTGTT 
GTGAAAATCA 
GTCGGTCATA 
AATTTGATGA 
GCACTTAAAA 
CAGAACCATA 
AAGGCGGTCT 
TGTCAACTTT 
AGTGAATGTG 
TATGTTACAA 
TGGTCCTACA 
TTATTTAGTG 
TGCTTATCGT 
TAATTTTAGT 
AGTAATTTTT 
CTTGATAAAA 
CCTGCCATGT 
GTTGTTGTAG 
GCGAATGCAG 
ATTGACGGTA 
ATATCTGTCA 
TATCTTCATG 
ATTTTGCCTT 
ATTGGTACGA 
ATTCGTATTG 
GCTGAAAAGA 



GAATTTCTTT 
GTTTCTGACT 
ATTTCGGATG 
GTCCTGAGCA 
TCAACsTCCT 
TTGTTCTTGC 
GAAGTCTTTT 
TTCTTCTTGA 
CGGCCTTTTC 
ATACCTATAT 
TTTTAGAACT 
TCCTATTTTT 
TAAAATTTTT 
AAAATTTATG 
TTTGGTGATA 
AGATATGGCA 
TTGTATCTTG 
AAATTTCCGA 
CTAAATCATT 
AGGCGATTTT 
ATGTGAGAAA 
GTACCCAATG 
AAGTGATTGC 
AGAAATTTAA 
TTGGGAATGG 
AGGAAGGTGT 
ATCATATCGA 
CAATTAAGGG 
CAGATCTTTT 
ATCGTCTTTT 



ATTAGTCAAG 
ATCTTGTTGA 
ATGGCTTGTG 
ATAAAGCTCA 
AGGACATAAT 
GAATGACAGT 
TAATTAGCTT 
CGATAGAAGT 
TAAGAAGAAT 
AAAAATATTA 
TTAATTGTTT 
ATCTTGTTGA 
TGTTAGAATA 
AGCAGATGGC 
ATTTTGACTG 
TTGGTTGTCG 
AAAAATTGGT 
ACTAATTTAC 
TTTTAATAGT 
TTATTATAAT 
GGAAGAGCCT 
GGGTGATGAA 
ACGTTACCAA 
GTTGCACTTG 
TATGGTTGTA 
AACAACTGAT 
GTTGGATCGC 
AATTGGTCCA 
AGATAAAGAT 
TGAAAAATTG 



1320 
TGCATACGAA 



ATGAGCTTCC 
TTCTGCTCTC 
TCTCCATCTC 
CTCAGGAAGA 
TGAAGTTGAA 
CTGAGCAATC 
CTCAGCGAGC 
TCTAGTTTGA 
TAGTTCTAAT 
GAAATTTAGG 
GGCTGGTATT 
AGTTTATAAA 
TAATTTCTAT 
GGAACATGTT 
TAAGGATTTT 
AATGAATCGG 
TCTTTTATGG 
TGGAAATAGC 
AAATTGTAAG 
GATGGCTCAG 
GGTAAAGGGA 
GGTGGTGATA 
ATTCCATCTG 
AATCCTAAAT 
AACTTGCGTA 
TTGCAAGAAG 
GCTTATATGG 
ATTTTCCGTG 
TATGACAGTA 



AAGTAGGGCG 
AGTAGCGCTT 
AAGAACAGTT 
CCGATTGAAA 
CGCGAAAAAT 
ATAGACAACT 
TTTTGGTTGA 
TCCATTTTTG 
ATTTTTTTAT 
AGGATTTACC 
TAGCAAATTT 
TTAACAATTC 
AAAGAAAAGG 
GATAGTATTG 
CATTTTAAAT 
ATTGTTTACC 
GGTTTTATTT 
AAAGATGATA 
AAATCTTTCT 
ATATAATTGC 
ACAAGATTAT 
AGATTACAGA 
ATGCTGGTCA 
GGATTTTCTT 
CTCTTGTAAA 
TTTCTGATCG 
AAGCTAAGGG 
ACAAGGCTGC 
AGCGTTTAGA 
AAGCGATTGT 



ATAAAATCGT 
GATAGCCTTG 
ATGATATTGA 
CAGTCACTCC 
CATGCTCAAA 
GATGATCAAT 
TGATACAAGG 
AGCAATGATA 
ACTAGAAAAT 
CAAAAGTTTT 
GTTTCTATTT 
AGGAATTGAT 
AGTATTTGAT 
AAGAAGAGTA 
TTTTAATTTA 
ATTATCGTGT 
CTTGTTGAGG 
GTAAATAGCT 
ATTGTTTCTT 
AGGTGAGAGT 
GACTTCAGTT 
CTTCCTTTCA 
CACGATTGTG 
CCCTGAAAAA 
AGAGTTGAGC 
TGCGCATGTT 
CGACAATAAG 
TCGTGTTGGA 
ACGTAACCTT 
TTTCGATGAT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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ATTTTTGAAG 


AATATTACGA 


ATATGGTCAA 


CAAATCAAGA 


AATACGTGAT 


AGATACATCT 


2700 


GTTATCTTGA 


ATGATGCGCT 


TGATAATGGC 


AAACGTGTGC 


TTTTTGAAGG 


TGCACAAGGT 


2760 


GTTATGCTAG 


ATATCGACCA 


AGGTACTTAT 


CCATTTGTTA 


CGTCATCAAA 


CCCTGTAGCT 


2820 


GGTGGTGTGA 


CAATTGGTTC 


TGGTGTCGGT 


CCAAGCAAGA 


TTGACAAGGT 


TGTAGGTGTA 


2880 


TGTAAAGCTT 


ATACGAGTCG 


TGTAGGAGAT 


GGTCCTTTCC 


CAACTGAGTT 


GTTTGATGAA 


2940 


; GTGGGAGAAC 


GTATCCGTGA 


AGTGGGTCAT 


GAATATGGTA 


CAACAACTGG 


TCGTCCACGT 


3000 


CGTGTAGGTT 


GGTTTGACTC 


AGTTGTGATG 


CGTCATAGCC 


GTCGTGTTTC 


TGGTATTACT 


3060 


AACCTTTCTT 


TGAACTCTAT 


TGATGTTTTG 


AGCGGTTTGG 


ATACTGTGAA 


AATCTGTGTG 


3120 


GCCTATGATC 


TTGACGGTCA 


ACGTATTGAC 


TACTATCCAG 


CTAGTCTTGA 


ACAATTGAAA 


3180 


CGTTGCAAGC 


CTATCTATGA 


AGAGTTGCCA 


GGTTGGTCAG 


AAGATATTAC 


CGGAGTTCGC 


3240 


AATTTGGAAG 


ATCTTCCTGA 


GAATGCGCGT 


AACTATGTTC 


GTCGTGTGAG 


TGAATTGGTT 


3300 


GGCGTTCGTA 


TTTCTACTTT 


CTCAGTAGGT 


CCTGGTCGTG 


AACAAACAAA 


TATTTTAGAA 


3360 


AGTGTTTGGT 


CCTAAGAGAT 


TTTTAAGATT 


TGTTTAAGAT 


AGGTCGGGTA 


TACTATAGAC 


3420 


GGTTACAAGA 


AGACCTCCTA 


ACTTGTTGTA 


ACAAATATCC 


TAAACTTTTC 


TTTTTCATAA 


3480 


TAATCTCCCT 


ATAGAGTCAC 


CGCATTCGGT 


GGCTTTTTTT 


GTGTTGGGAT 


TCATGATATA 


3540 


ATAATAAAAT 


CGATAAGTAG 


GAAAAGAGAA 


AAGAGATGTA 


TTATACGCTT 


GAAGAAAAAG 


3600 


AAGTCTTTAT 


GAGGGAGGCT 


TTGAGAGAGG 


CTGAGATTGC 


TCTTGAACAC 


GATGAAATTC 


3660 


CAATTGGTTG 


TGTGATTGTC 


AAAGATGGGG 


AAATCATTGG 


TCGTGGGCAT 


AATGCGCGTG 


3720 


AGGAATTACA 


GCGAGCGGTT 


ATGCATGCGG 


AAATTATGGC 


TATAGAGGAT 


GCGAACTTGA 


3780 


GTGAGGAGAG 


TGCGCTTGCT 


GGATTGCACA 


CTTTTTGTGA 


CCATTGAACC 


G 


3831 



(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

CCGCTGTTCC AACCGCAACA TACCATAGTC CGTACGGGAT TCGAACCCGT GTTACCGCCG 60 

TGAAAAGGCG GATGACTTAA CCCCTTGACC AACGGACCTG AGTTGTTATT TTCAACTCTT 120 

ACTATTATAC AGTCTTTTCA AACTTTGTCA ACTACTTTTT CTAATTTTTG TTTATTTTTT 180 
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CAACTTATAG TAAAAAAAGC CAGAATTATA CTGACTCTTC TATCGCTCAT TAAACTTAGA 240 

AGCACGTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG ACTTTAACTG TTTTTCTTAT 300 

ATTATAGTCC ATCATGAATT CTGCATCTTT ATTTTCAGCA TTAAGCTCTA AAAGGAATTC 360 

TCTACAAGCA CCGCAAGGCA TGGCTGAACT TCCACCATAA GGTGGTTTGT CTCGAAAGGC 420 

TAATACTTTC TTAACCTTAG TTTGTCCTGA AAATTGGTAC ATATTGAAGA GGGCCGCCCG 480 

TTCTGCGCAG AGATGGAAAA CACCACAGGT TCCCTCCATA CAGAATCCTG TAAATATTTG 540 

TCCATCTCCT GCTTCTACTG CAGCTACAAC ATGATTGGCA TAAACAAAGT CTGATACTTC 600 

ATGTGGATTG TATAGTTTCT GTGCTTCTTC GTACATCTTT TCCCAGATGT CCATTATTGT 660 

ATCCTCTTTA TTTAGAGATT TCTTTTAGCA TGTTTTCGAT ATGCTGAATT GATTTTTCAC 720 

GTCCAAGCAA GAAAATTGTA TCTGGTAATT CTGGCCCATG CATTTCGCCT GAAACTGCGA 780 

TACGAATAGG CATGAAAAGA TTTTTCCCTT TAATACCTGT TTCTTTTTGG ACTGCTTTAA 840 

TTTGTGGGAA GATATTTTCT GTCACAAATT CATCATCTGT CATCGCTTCA AGTTTTGCTT 900 

TGAATGCTTC AAGAACTGTT GGAACTGTTT CACCCGTCAT GACTTCGCGC TCTGCTTCTG 960 

TCAATTCTGG GAAATCTGAG AAGAAAAGAT CTGTCAATGG GATAATCTCA TCTACTGATT 1020 

TCATTTGTGG TTTATAGAGC TCAACTAATT TTTCAGCCTT GTCAGTCAAA CGGCCTGCTT 1080 

CCTCTAAGAA TGGTTTTGCC ATTTCAAAGA TGGTTTCAAG GTCTGCATTC TTGATATAAT 1140 

CATTGCTCAT CCAGTCTAGT TTTTTCTGAT CAAAGGCTGC TGGTGACTTG CTGAGGCGGT 1200 

TTTCATCAAA AAGTTTAATG AATTCTTCAC GAGAGAAAAT CTCATCCCCA CCACCTGGGT 1260 

TCCAACCAAG AAGAGCAATA AAGTTAAAGA CTGCTTCTGG AAGGTAACCT TTCTTTCGGT 1320 

AATCTTCGAT AAATTGAAGT GTATTAGTAT CACGTTTAGA TAACTTCTTA CCAGTTTCAG 1380 

AGTTGATAAT CAAGTGTCAT GTGACCGAAC TCTGGAGCTT CCTCAACCTA AGAGCGGGTA 1440 

T 1441 
(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
' (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

CGGCTTATGT AGTGGCAATC TTTCTACGTA AGCGAAACGA GGGGAGATTA GAGGCGCTAG 60 

AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 120 
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ATATGCACTT GATTGGACAA AGTCAAGTGG CTTTCCGTGA ATGGAATCAA AAATGGGTCG 180 

ATTTATCTCT CAACTCTTTT GCCGATATTG AAAATAATCT CTTTGAAGCA GAAGGCTATA 240 

ACCATTCATT TCGTTTTCTC AAGGCCAGTC ATCAAATTGA CCAAATTGAG AGTCAAATTA 300 

CTTTGATTGA AGAAGATATT GCGGCAATTC GCAATGCTTT GGCAGACTTA GAGAAGCAAG 360 

AATCTAAAAA TAGTGGTCGT GTTCTTCATG CTTTGGATTT ATTTGAGGAA CTTCAGCATA 420 

GAGTTGCTGA AAATTCAGAA CAGTATGGTC AAGCCTTGGA TGAAATTGAA AAACAATTAG 480 

AAAATATCCA ATCTGAATTT TCACAATTTG TAACCTTGAA TTCATCGGGT GACCCTGTGG 540 

AAGCCGCAGT GATTTTGGAT AATACAGAAA ATCACATTTT GGCCTTAAGT CATATTGTGG 600 

ATCGTGTTCC AGCCTTGGTT ACGACGCTTT CTACAGAATT GCCAGATCAA TTACAGGATT 660 

TGGAAGCCGG TTATCGTAAA CTAATTGATG CTAATTATCA TTTTGTTGAA ACGGATATTG 720 

AAGCGCGTTT CCACTTGCTT TATGAAGCAT TCAAGAAAAA CCAAGAGAAT ATTCGTCAGT 780 

TGGAATTGGA TAATGCCGAA TATGAGAATG GACAGGCACA AGAGGAAATC AATGCCTTGT 840 

ATGATATTTT TACTCGAGAA ATTGCTGCTC AGAAAGTAGT GGAAAATCTA CTTGCAACTC 900 

TTCCAACTTA TCTTCAACAT ATGAAAGAGA ATAATACTTT ATTGGGAGAA GATATTGCAC 960 

GTTTGAACAA GACCTATTTA CTTCCTGAGA CAGCTGCAAG CCATGTTCGT CGTATTCAGA 1020 

CAGAATTAGA GAGTTTTGAG GCAGCTATTG TTGAGGTAAC TTCAAATCAA GAAGAACCAA 1080 

CCCAAGCTTA TTCAGTTCTT GAAGAAAATC TTGAGGATTT ACAAACTCAA CTAAAAGATA 114 0 

TTGAAGATGA GCAAATTTCA GTTAGTGAGC GCCTGACACA AATTGAGAAA GATGATATTA 1200 

ATGCACGTCA AAAGGCCAAT GTTTATGTCA ATCGTCTCCA TACTATCAAG CGATACATGG 1260 

AAAAACGCAA TCTGCCAGGT ATTCCACAAA CTTTCTTGAA GTTATTCTTT ACGGCAAGCA 1320 

ATAATACCGA GGATTTAATG GTTGAGTTAG AACAAAAAAT GATTAACATT GAATCTGTTA 1380 

CCCGAGTTCT TGAAATTGCA ACGAATGATA TGGAAGCTTT AGAAACGGAA ACTTATAATA 1440 

TTGTACAATA TGCAACTTTG ACAGAGCAAC TCTTGCAATA TTCTAACCGC TATCGCTCAT 1500 

TTGATGAACG CATTCAAGAA GCATTTAACG AaGCTTTAGA TATTTTTGAA AAAGAATTTG 1560 

ATTATCACGC TTCATTTGAC AAGATTTCTC AAGCATTGGA AGTGGCAGAG CCTGGTGTAA 1620 

CCAATCGCTT TGTTACCTCA TATGAGAAAA CACGTGAAAC GATTCGTTTT TAATAAAAGA 1680 

AAAAGATTTT ATTGTGTGAG GAGCAGAATC AAATCTTTTT CTATAGTTGT GGGGAGATTT 1740 

ACTTCATTTT CTCCTGAGAT TGAGTTTTTG CCCAGCCGAT TTATCCACTA CCTCAAAACA 1800 

GTGTTTTATA CTCTTCGAAA ATCTTTTCAA ATCACGTCAG CGTCGCCTTA CCGTACTCAA 1860 
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GTACAGCCTG AGGCTAGCTT CTTAGTTTGC TTTTTGATTT TCATTTAGTA TTAAAGTGAT 1920 
TTCGCCAGTC TTATCTGCAG CTTCAAATCT GTACTTTGAG TAACTTGGTA ACCGTCCAAT 1980 
AACGAAGTCT ATTGAAAAAT CTCCAGACTA GAGAACTCAC GGATAGTTCC TAATCTGGAG 2040 
ATTTCTTATT TGCACTTTTC TTGTACAACT TTAGTCCACG GTAAATAGAC CTCTAAAACC 2100 
TCTTTGTTTA CGAGAGTTTC CTCGTTTGGA AGACATTCTA GAAGATAGGA TAGATATTTC 2160 
TCG CTATTTA TACTAGACTA AAATCAAAAA GCATTATATA ATAGTGATAT GAAATCAACT 2220 
AAAGAAGAAA TCCAAACCAT CAAAACACTT TTAAAAGACT CTCGTACAGC TAAATATCAT 2280 
AAACGCCTTC AAATCGTTCT ATAGTAAAAT GAAATAAGAA CAGTACAAAT CGATCAGGAC 2340 
AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGAGGTG TACTATTCTA GTTTCAATCT 2400 
ATTATATTTC GTCTGATGGG CAAATCTTAT AAAGAGATTA TAGAACTTTT ATAGTAGATT 2460 
GAAATAAGAT GTGAACAACT CTATCAGGAA AGTCAAATTA ATTTATAGAA ATATTTTAGC 2520 
AGCCAAGGTG TACTGTTATA GATTCAATAC ACTATAGACT GTAATCAAAC AACGATTTGG 2580 
CGAAATGTAA AAAAATATGA GGAGTTCGGA CTCGACTCTC TCCTTCAAGA AACACGTGGT 2640 

GGTCGTAACC ATGCATATAT GACAGTTGAG GAAAAGAAAG TCTTTCTTGC CCGCCATTTG 2700 

AAGGCTGCAG AGGCAGGAGA ATTTGTTACA ATTGATGCCT TATTTCAGGC TTATAAAAAG 2760 

GAGTTAGGTC GTTCCTACAC ACGTGATGCC TTCTATCAAC TGTTGAAGTG CCATGGTTGG 2820 

CGAAATATTA TGCCACGTCC AGAACATCCT AAGAAAGCAG ACGCTCAAAC CATTGTCGCG 2880 

TCTAAAAATA AAATCTCAAT TCAAGAAGAA AAGAAAGCGC TTTAAAACCA GTAGACGTTT 2940 

TCGTAAGGTT CGCTTGATGT ACCAAGATGA GGCTGGTTTC GGTAGAATCA GTAAACTGGG 3000 

ATCTTGTTGG GCTCCAATAG GAGTAGGTCC ACATATCCAT AGTCACTATA TACGAGAATT 3060 

TCGCTATTGT TATGGAGCTG TTGATGCCCA TACAGGCGAA TCATTTTTCT TAATAGCTGG 3120 

TAGATGTAAT ACTGAGTGGA TGAACGCCTT TTTAGAAGAG CTTTCACAAG CTTATCCAGA 3180 

TGATTATCTT TTACTCGTTA TGGACAATGC TATATGGCAT AAATCAAGTA CCTTAAAGAT 3240 

TCCGACTAAT ATTGGTTTTA CCTTTATTCC TCCATACACA CCAGAGATGA ACCCCATTGA 3300 

ACAAGTGTGG AAAGAGATTC GTAAACGTGG ATTTAAGAAT AAAGCCTTTC AAACTTTGGA 3360 

AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG TGATAAAGTC 3420 

CATCGTTAAT CGGAGATGGA CTAGAATGCT TTTTGAAAAC AGATGAGTAT AAAAAGAAAG 3480 

TCCTCATTTC AATAGAAATC ACGACTTTCT GATGGATTTA TAGTAAAATG AAATAAGAAC 3540 

AGGACAAATC GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 3600 

ACTATTCTAG TTTCAATCTA CTATATTTTT GGAGTGATAG AAAAGCCCTT CATAAGCTAG 3660 
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TCTACTTGTT CAGGTGCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3720 
CCGTTCTCAA AGCGTTTATA TAGTAGCCAA AATCCTTGAC CATCCCAGTA AAGGGCTTTA 3780 
AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTGAC CGGAGAAAGA ATCCAATTCA 3840 
AAGTGGGTTT TAACTACATA GGCTAATGAG TCTATTCCCT GCCTCATATC TGTCTTGCCA 3900 
CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATACCTTTCC 3960 
-TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG AATTAGGATA GATACCTTGT 4020 
TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTGCT TCTTCATTGA 4080 
TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 
CAAAAGTTAG ACAGAAAAAA TCTAACTTTT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4200 
ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 
ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATGAT TAAATTGATT GATCGTTACG 
GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 
TGATTAATAA AGTCTTAC 
(2} INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC ACCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 120 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAGGCTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 240 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 300 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAG 360 

TATCTTTAAA ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 420 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGGT TCTCCTTGGG ACTGTGCTAC 480 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 540 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATAGAA TGAAGCAGTT TTGGCTACAT ACGCTTCTAA GAACCTATAG TTCAGTGATG 660 
ATCATTATCA TTGCGAGTTT TGCAATCTTA CTCTCTTACG CTGTCTGGGA TTCACGTG 718 
(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 95: 

TCGGTACCAA AATTCTGGAT TTATACTAGC AAAGATCCAA GAGCAAATTA TTTAACAGAT 60 

TTAGGTCTAG TTTTCCCTGA ATCATTAAAA GAATTTGAGA GTGAAGATAG TTTTGCAAAG 120 

GAAATTTCTG CAGAAGAAGC AAATAAGATA AATGATGCTG ATGTAATCAT AACTTATGGT 180 

GATGATAAAA CTCTTGAAGC TTTACAAAAA GATCCTCTTT TAGGTAAAAT AAATGCAATT 240 

AAAAATGGTG CCGTTGCTGT AATTCCAGAT AATACACCGT TAGCAGCCTC ATGCACTCCA 300 

ACACCACTTT CAATAAACTA TACTATTGAA GAATACCTAA ATCTTTTAGG AAATGCATGC 360 

AAAAATGCGA AATAAAAAAC AAATAAACCT AGGCATAATT TTTATAATCT GCCTAGGTCT 420 

TCTTATTACA ATATTTTTGT CATTAAAGCT TGGAACAAAA GAAATTAATA TCAGAGATTT 480 

TTTAGCAGCT TTTGGAATGG GTAATACAAA TGATGATTTT ATTAAATCAA TTATATATAA 540 

rAGAATACCT AGAACTATTT TTGCAATTTT AGCAGGTTCT AGTCTTGCCA TAAGCGGTGT 600 

ATTGATGCAA TCAGTTACTA GAAACCCAAT AGCTGATCCA GGTATACTCG GTATAAACAC 660 

AGGAGCAAGT CTTAGTGTAG TAATTGGTCC TTCtTTTTAG GGAATTCATC AAGCATAA 718 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

GAACTAATCA TTTTTACAGG ATGAGATTTA CAGCAGAGAG TTTGAAGGCT TTATCAAAGG 60 

TTTTTCTTGG CATAATGACT TTTCCTCGTT TCCACTTAAT TTTGTGTCTA CTTTATTATA 120 

CCAAGTCCAC sCTTAAGTTA GATAATAAAT CTAACTTAAG GAAGCTAGAA GGATGAGAAT 180 

CCAGGTGGTC AAGAGTCCCA AACTTAAGCT GATGGGGACA CCCAGAATAA TTTGCTTTTT 240 
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GAAGGCAAGG CCACGTTCCT CTATATTGGG AAGTGAGAGT TGAATGAGAG 


AACCAGCTGA 


300 


TGAAAAGGGT GAGATATTAG TAGATAGAGC GCCAATAACG GTGGCTGTTG 


TGAGTAAGTG 


360 


AATATCAATC TGAGGATTTT GAGCACTGAT GATAGCAATG ATGGGAAAGA 


GGGCTGGAGC 


420 


TACAACGGAT AGGGTGGAAC TAAAGAGTGA CATCACTCCG GCTATCACAC 


AAAAGAACAG 


480 


AGGTAACCAG AAATGAGGAA TGGTTGTTGT CATGAGGTGC CCTATCAGTG 


TGACTAAACC 


540 


TGACTTGACC GCTAGAGACA TTAGTAAGCT CATGCCGCAG AGCATGATAA 


TTGTAGCCCA 


600 


GGGAACCTTA GCTAAAATGG CTTCTTGCTT CCCTAATTTG AGCCTTAAGG 


CGAGGCAGAC 


660 


CATGAGTATT GAGACAAAGC CAATATCAAA TGTTTTTTGA TAAGTAGCTA 


TCCAGGCGAT 


720 


GTTTGGGAAA ATGAGATGCA ACAAGGGAAA AAGCCAAACC AAAACCATGC 


TGCTGATCAT 


780 


GAGCAAGGTG GTTTGTCTTT GAACCTTGCT GAGGAGTGGT GGTTGGTCAA 


TAGTCAAGGA 


840 


TGAGTTTGTT CTTCCCTTAC TATAGTGACT GTAACAGGAT AATAAAAGCA 


AGACGATGAG 


900 


TGGGTAGATA ATGCTGACGA TAAAGATATG ATTGCCAAGT GAAAAAGCTT 


GCTCTTCCCA 


960 


TCCCATTTGC TTAAACAGGC CTTGAAAGAC AATGCCTGAG CTACTGGTTA 






CCCTCCTGAA GCTCCCCAAT TGACGGCTTG AGCTCCAATC AAAGGGTGTT 


TGTCCGCTTT 


1080 


TTGACAGAGG GTAATCGCTA GAGGACAGCA AACGGCCATA GTAGTGAAAA 


ATCCAGCACC 


1140 


TAAAGCAGAC AAAAGGGTTG CCATCAGGTA TAAAATCATG TAGAGGGCGT 


TAGGGTGGGT 


1200 


GCGTGTGCGG TAGAGAATGT GTTGAGCCAA AACATCAAGA GTACCGTTAG 


TTGTTGCAAC 


1260 


GTTATAAAAG AGAGAGACGC TAAAAATGGT AAAAAAGAGT GAGGTTGGCC 


AAAAATGAAG 


1320 


AAGTTCTTTG GGGCTTAATC CCATGAGAGT GGTTGCGATG AGGTAAGAAA 


AAGCAATAGC 


1380 


CAGCAGGCCA ATATTGATTT TGGTGCGGTA ACCAATTCCA ATGGCTAGAG 


CAATGG 


1436 


(2) INFORMATION FOR SEQ ID NO: 297: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1696 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 
CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 
GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT 
CTAAAAATCG AAGGACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC 
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AAGGCGGCTG TGGATGCCTA TCTTGAAAGT CCTGAAAAGT 
TTGGTGGACG AGATGTGGAA GGTTGCCCAA CGTGAACTGG 
ACACCATCTG AAAATGAGCA GTTGTTTGGT GCTCGCCGTA 
GTCGCTGAAG TGGTTTCTTA TGATGATGCG GCACAAACAG 
GTCATTAACG AAGGGGACCA AGTTGAGTTT TATGGTCCAG 
TATATTGAAG ATTTGCATGA TGCCAAAGGC AATAAAATCG 
GAACTATTGA CTATTAAGGT GCCTCAACCC GTTCAATCAG 
AAAGAAGGAC TCATCAATCT TTATAAGGAA GATGGAACCA 
GAAAGGAAAA GGAAATGATA GAGGCACAGG GTTTCTTAGT 
TTCATTACCA TAGCAAGCTG GATATTATTG CTTTACAATG 
ATGCTTGTTA TCGGTGTCAT GATTCATTAG AACATCACCC 
CTTTGATACA GGATAAGCCT ATTTTATGTG GTGTTTGTCT 
AATATAAAGA AAGCTTAAGT TGCCCCTTTT GTTTTTCTCG 
ATCATAAGGA ACGCTATTTT AAATAGCAAA TCATCTAGTT 
ATTTCAAGAG AAAATGAAGT AAATCTTCCC ACAATAAAAC 
AATACCTGAT ACTATGCGTT TTTAAGATTT TAAAGACTTT 
GACTACTTGT TAAAACTGGG TTAATTTTCG ACTGTTTAAT 
AAGGTTAGAA TTGTCAAAAC AATCCGTCTA GAGTATGCGT 
TTCTCAGTCA TGCCGTTGGA AGTACGACCT TTACGATATG 
GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 
TGACCATATC TCAGATATGA TTGAAAATGG TGTGGACAGT 
GGAGTCTATT CACTATGTAT CAACAGTAAC CAACTGCTAC 
TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC 
GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT 
GTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT 
TGATGATGCG GCGGTA 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



TTGAAGCTAT 
CTACAGGATT 
AAATTCCTGA 
CAACAATTCG 
GTTTCCGTCA 
ACCGCGCTCC 
GAGATATGGT 
GCGTCACAGT 
GGATAAGCAA 
CTATGATTGT 
TTTTGAGCCG 
AAAACTACTA 
CTTTAATCCA 
TTGAAGTAGG 
GCATAATATC 
TTTCCTTTAT 
AGTTATTATG 
GATGCCAACC 
CCATTTGGGA 
GCCGTTGATA 
CTAAAAATCG 
AAGGCGGCTG 
TTGGTGGACG 
ACACCATCTG 
GTCGCTGAAG 



CAAACAAGAC 
TTACTATGGT 
GTACAAGTTT 
TCAACGAAAT 
TTTTGAAACC 
AAATCCAATG 
TCGTGCATTA 
TCGAGCTTAA 
ACAAGATGCA 
AAAAAGTATT 
TATCCCTTAT 
ACATATAAGC 
GGTTGCCAAA 
AGAAAACTCA 
AAGATTGTTC 
CTGGTATTTT 
CAAAGTCTAA 
GTGGTGGATG 
AAGAACGTAA 
TGTCTATGAT 
AAGGACGTAT 
TGGATGCCTA 
AGATGTGGAA 
AAAATGAGCA 
TGGTTTCTTA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1696 



X329 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
CCGAGTTTAT TATGGTTTCT TCGGAATTTA TCTCAAAGAT TGAATTTGCT TGCAATAAGA 
AAGAAAGTCT TTATAGTCAA AGCAAATTTA AGTATGCGAT TCGTTCGATG TTCGCAGGTG 
CATTTTTAAC CTTCAGTACT GCTGCAGGTG CAGTTGGGGC TGACTTGATT AATAAAATTG 
-CACCAGGTAG TGGACGCTTC CTCTTTCCAT TCGTTTTTGC TTGGGGCTTG GCCTACATTG 
TTTTTTTGAA TGCCGAGTTG GTCACTTCAA ACATGATGTT CTTGACTGCT GGTAGTTTCT 
TAAAAAAAAT CTCTTGGAGA AAAACAGCTG AGATTTTACT ATACTGTACC TTGTTCAACC 
TTATCGGAGC CTTGATAGCA GGGTGGGGCT TTGCTCATTC GGCAGCCTAT GCGAATCTGA 
CACACGATAG TTTCATCTCA GGTGTTGTTG AGATGAAGTT AGGCCGCTCA AATGAATTGG 
TCTTGCTTGA GGCGATTTTG GCAAATATTT TTGTAAATAT TGCGATTCTG TCATTTATTT 
TGGTCAAAGA TGGTGGTGCC AAACTTTGGC TTGTGTTGTC AGCTATTTAC ATGTTTGTAT 
TCTTAACAAA CGAGCACATT GCGGCGAACT TTGCTTCTTT CGCGATTGTG AAATTCAGTG 
TTGCTGCGGA TTCAATTGCC AACTTCGGTG TTGGAAATAT GCTTCGCCAC TGGGGTGTGA 
CTTTCATCGG AAACTTTATC GGAGGAGGCC TCTTGATGGG TCTTCCATAT GCCTTCCTCA 
ATAAAAACGA AGATACTTAT GTAGATTAAG AAAATGAGCA CGATTGAGTC GTGCTTTTTT 
CATTTTCAAA ATAAGGTAAT AGCTATTTCT TATATCAAAA TATAGAAAAC TGATATTTGT 
ArACTATAAC TCAAGGTGCT ACAATATCCT TAATAAAATA ATATGGAGGT CACCTTATGA 
CTTGTGATTT TAAATllTGAA ACTCTACAAC TACATGCTGG TCAAGTTGTG GCTCCAGCTA 
CT 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 
CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 
TTCATTATGA GTGGCAGGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 
CATTAATAAA GTTCCTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 
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AATAGATTCT CTCGCATTAG TTACATAACC AGATATAGGC ATATCTGATA TAGATACCCA 240 

AGGTATTTCA GTTCCCCAAA AAGTAGCTTC ACTGCGTGGA GGAGTTTTTC CTATTCTGAA 300 

GTTAACTAGG CTAGCAAATT TAATATATCT CCATGCTTCT GGGATTTCAT ATATAGGATA 360 

AGAGGTTGTT TCGTCTTTGT TCCCATAATA AGAGTTATCA TCTCCTTGGG AAACAATAGA 420 

AATGTCCAAA TCTTTCTTTT TAATCTTGCC TTCTTCAAAG AGTTTTTGTT TTTCTGCTCG 480 

TATTTTTTCA AGTAAAACTT CGACTGATTC ATCATTTGGG TCTTGTTCAA CTAATTTTCC 540 

TTGCATAGCA TATTGAAGAA TAGATTTTTT TAGTTTATCT GGAAATTCTT TATCTAGCTG 600 

TTCTAGTCTA TTATAACTTT CAGCATATTC ATCTACTTTT TCTAAAGCTG ATTCGATTGC 660 

TTC 663 
(2) INFORMATION FOR SEQ ID NO: 300; 

<i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 881 base pairs 
(B) TYPE: nucleic acid 
(Ci STRANDEDNESS : double 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

CGTCGCTGAA CATGTCAACA GCAAATTAAA CTAAACAAAC TAAAATTATG TGATACTTCA 60 

CATAATTTTC TTTAGAAAAT ATTATCAGAA GAAAGTTGAG AAAAATGGCA GAAAAAACAT 120 

ATCCTATGAC CCTTGAGGAA AAGGAAAAAC TTGAAAAAGA ATTAGAAGAA TTGAAATTGG 180 

TTCGTCGACC AGAAGTGGTA GAACGCATTA AGATTGCCCG TTCATACGGT GACCTTTCAG 240 

AAAACAGTGA GTACGAAGCA GCTAAGGATG AACAAGCCTT TGTCGAAGGA CAAATCTCTA 300 

GCTTAGAAAC AAAAATCCGC TATGCTGAAA TCGTCAATAG CGACGCAGTT GCCCAGGACG 360 

AAGTAGCGAT TGGTAAAACA GTCACCATCC AAGAAATTGG TGAGGACGAA GAAGAAGTTT 420 

ATATTATCGT AGGTTCAGCT GGTGCAGATG CCTTTGTAGG TAAGGTTTCA AATGAAAGCC 480 

CAATTGGGCA GGCCTTGATT GGCAAGAAAA CAGGTGATAC AGCAACCATT GAAACGCCTG 540 

TTGGTAGCTA TGATGTAAAA ATCTTGAAGG TTGAAAAAAC AGCCTAAAAA CAGAAAAAGG 600 

AGTGGGGAGG CGATGTGCTT CACTCACTCC TTTTTCCATT TTGCTACTCT TCGAAAATCT 660 

CTTCAAACCA CGTCAGCGTC GCCTTGCCGT ATGTATGGTT ACTGACTTTG TCAGTTTCAT 720 

CTACAACCTC AAAACAGTGT TTTGAGCTAA CTTCGTCAGT TTCATCTACA ACCTCAAAAC 780 

TATGTTTTGA GCTGACTTCG TCAGTTTCAT CTACAACCTC AAAACCATGT TTTGAGCCGA 840 

CTTCGTCAGT TTCATCTACA ACCTCAAAAC TATGTTTTGA G 881 
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(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 



CCTTTTTTAA 


TACAAGTTAT 


TTTGATTTAA 


CCGGCTTGTC 


TTGAGCTGTC 


TGCAAAGCTG 


60 


TGGCAATCGT 


ATCTGCATAC 


AATTTTGCTC 


CTGCTTCGAT 


AGTGCTACTC 


TCACTCCCGA 


120 


AATGAACCTG 


GTCTGTTCCA 


GCCCAAATTT 


CTGGATGCTC 


TTTCGCAACT 


TGATTCCAAT 


180 


CTGCTATCGT 


AATGTAAGGT 


GTCTTCTCTG 


CCAATTCTCT 


CATATAGGCA 


GCAGCCTTCT 


240 


CAACGATGGC 


ATAGGTCTCT 


TTTGTCTTAT 


CTCCCTCATA 


AGGAGTCACC 


AAAATCATAT 


300 


GGTGTCCCTT 


AGGAAGATTT 


TTCACGATAC 


TGTCCCAGTC 


ATCCTTGTAA 


TTCTCAGGAT 


360 


TATTTACCCC 


AGTCGCAATG 


ACCACCGTCT 


TAGGTAAAAA 


TTTATTCTGG 


CTATTATTTA 


420 


GCATGATTTC 


ATTTGCGGTC 


TTGGTTGTTA 


CGCTGACCTG 


CGCGTTAATC 


TGTGCTCCAG 


480 


GAAGAGCTGT 


CTGTAGTGCT 


GTATTTGCCC 


TTAAAGCCAC 


TGAGTCACCA 


ATTAACATAG 


540 


TGCCATCAGC 


AATTCCCAAA 


CTGTTTGCAT 


CTGCCCGTTC 


TGCCATCACC 


TTGGTCTGGC 


600 


CAATATTTGT 


TGCAGCTTGC 


TTCAAGCCAT 


TGACAGTCAA 


GTCTGTCTCA 


AACGCTCCCA 


660 


CTTGTGGTGC 


CAACAAGGTC 


ACCGTGCAGA 


CAATGATGGT 


CAAGATTCCT 


GTACCTGCTG 


720 


CAAGAATTGC 


GTGAATATAA 


GGCAGGGGAC 


GAAsGGTTTG 


GACAATAGGT 


GTGTTCTTGC 


780 


CTGCAATCCA 


AGGTTCCAAT 


ACATAAAATG 


ACAGACTGGC 


AAAGCCATAA 


GAACAAATCA 


840 


GAGTCAGTAA 


TACAGCAAGA 


AGATTTGATG 


TCAACTGTGA 


GAAAATGATA 


TAGAAAGGCC 


900 


AATGGAAAAG 


ATAAACCGCA 


TAGCTAGTAT 


CCGCTAAAAA 


GCTGATAAT 




949 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 60 
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CGAAAATAAA AGCCGTGTAC AGGCGACCAA ACCAACGTAC ACGGCTAAGG AAAAATAACA 120 

AAACTCAAGC AAAGGCAAGG CGCGTGGTTT TGTTAGGTAT TTAGCAAGGG GACAAACCCC 180 

TTTGTAAATA ATCTCCTCTT ATTTTATCAA AATTAGAGGA AAATGACAAC TTAATTTATA 240 

AAAAGGAAAA ATGGAGGATA TAAATGGAAA TTCTGTCTAA AGAAATACAG TTACAGGGCT 300 

TACAACTTCT TAAACAGACT CTTGAAACTT TAGTTGAGCT AGAAAAACAA CGATCTAGTA 360 

AGTTAGATTT AATTTCTCGT AAAGAATTAA TGGATCTGCT AGGTATAAGT GCTACAACCC 420 

TTGATAACTG GGAGGATCTT GGTCTTAAAC G AT AT C AG AC TCCGATGGAT GGAGCTAAGA 480 

AAGTATTCTA TCGTCCGTCA GATGTGTATT TATTTTTAGC AATAAAATAG GAGTTATGAA 540 

ATGAAAATTG TTACTTTCAA ACCAACTAAA CAAATAGACG ATGGGTTTTA ACTGCCAGGT 600 

ATTGACATTC TATTTGTCTC AG 622 

(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 



CGCTAACTTG 


CAAACAAAAG 


AAGAACGCAA 


ACTCCACAAA 


TCCTTTACGC 


AGAAACTCAA 


60 


TCTCATCTAC 


TTACCTTGCT 


GACTTGGTAG 


AGTATGTTGC 


AGACAAAGAC 


TTCTCAGTAA 


120 


ACGTAATTTC 


TAAATCAGGT 


ACAACAACTG 


AACCAGCGAT 


TGCTTTCCGT 


GTCTTTAAAG 


180 


AACTCTTGGT 


TAAGAAATAC 


GGTCAAGAAG 


AAGCTAACAA ACGTATCTAT 


GCAACAACTG 


240 


ACCGCCAAAA GGGTGCTGTT 


AAGGTTGAAG 


CAGACGCTAA 


CGGTTGGGGA 


ACATTTGTTG 


300 


TTCCAGATGA 


TATCGGTGGA 


CGCTTCTCAG 


TATTGACAGC 


CGTTGGTTTG 


CTTTCAATCG 


360 


CAGCATCAGG 


AGCTGACATA 


AAAGCTCTTA 


TGGAAGGTGC 


GAATGCAGCT 


CGCAAAGACT 


420 


ACACTTCAGA 


CAAAATCTCT 


GAAAACGAAG 


CTTACCAATA 


CGCAGCTGTT 


CGTAACATCC 


480 


TTTATCGTAA 


AGGCTATGCA 


ACTGAGATCT 


TGGTAAACTA 


TGAGCCATCA 


CTTCAATACT 


540 


TCTCAGAATG 


GTGGAAACAA 


TTGGCTGGTG 


AATCAGAAGG 


AAAAGACCAA 


AAAGGTATCT 


600 


ACCCAACTTC 


AGCCAACTTC 


TCAACTGACT 


TGCACTCACT 


TGGTCAATTT 


ATCCAAGAAG 


660 


GAACTCGTAT 


CATGTTTGAA 


ACAGTTGTCC 


GTGTTGACAA 


ACCTCGTAAA 


AACGTGCTTA 


720 


TTCCTACTTT 


GGAAGAAGAC 


CTTGACGGAC 


TTGGTTACCT 


TCAAGGAAAA 


GACGTTGACT 


780 


TTGTAAACAA 


AAAAGCAACT 


GACGGTGTTC 


TTCTTGCCCA 


CACAGATGGT 


GATGTACCAA 


840 
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ACATGTATGT GACTCTTCCA GAGCAAGACG CTTTCACTCT TGGTTACACT ATCTACTTCT 900 

TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG 960 

GTGTTGAAGC TTATAAACGT AACATGTTTG CCCTTCTTGG AAAACCAGGA TTTGAAGAAT 1020 

TGAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CCACTCTTTT 1080 

TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGCA 1140 

AAAAGGCAGA CGCCTAGATA ATAGGAGAAA CTATGTCAAA AGATATCCGC GTACGTTACG 1200 

CACCAAGTCC AACAGGACTA CTACACATCG GAAATGCTCG TACAGCATTG TTTAATTACT 1260 

TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CGAAGATACT GACCGTAAAC 1320 

GCCATGTCGA GGATGGTGAA CGTTCACAAC TTGAAAACCT TCGCTGGTTA GGCATGGATT 1380 

GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TGAGCGTTTG GACTTGTATC 1440 

AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 1500 

AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTGG CGAAACACCA CGCTACATCA 1560 

ATGAATACCT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATCGCAGAA CGTGAAGCAG 1620 

CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGGTATCTAC AAGTGGCATG 1680 

ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGGTGAC TGGGTTATCC 1740 

AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATATGC 1800 

AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGG 1860 

TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG ATTATCCACT 1920 

CTGAAACTG 192 9 
<2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

AAATTTAAGA AAAAGGAGAC ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 60 

ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAAGCT GAGAAAGCAC TTGCTGGTAA 120 

AGCGGAAGTT AGCTACCTTG ATTATTCAGC CCTTCCTCTC TTCAGCCAAG ATTTGGAAGT 180 

TCCAACACAT CCAGCTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGCTATCTG 240 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGGTACA GTGAAAAACT TGCTTGACTG 



300 



GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCGTT TCTGCCCTTC AAGACAAGTT 



360 



TGTCACAGTA TCATCTGTAG CCAATGCAGG GCACGATCAA CTTTTCGCTA TCTACAAAGA 



420 



CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTGCAC GTGTTAATGA 



480 



CTCTGCCTGG GCAsACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 



540 



ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 



600 



AAACTAkGAA GCTArCCGCA AGCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 



660 



CGAGTGTnnA ACATATATAC GGTAAGGCGA CACTGACGTG GCTTGAAn 



708 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 



CTTCTTTTCT 


TGGAAATAGG 


TGTATAATAC 


GTTTATTAAA 


TTTTTGAGGA 


GTTGTCTATG 


60 


AAGAAAAGTT 


TTATCCATCA 


acaagaagaA 


ATTTCCTTTG 


TCAAAAACAC 


TTTTACCCAG 


120 


TATTTGAAAG 


ATAAGCTAGA 


AGTTGTCGAA 


GTTCAAGGTC 


CTATCTTGAG 


TAAGGTCGGT 


180 


GACGGAATGC 


AGGACAACCT 


GTCTGGTGTG 


GAAAATCCAG 


TATCGGTCAA 


GGTTCTCCAA 


240 


ATCCCTGATG 


CTACTTATGA 


AGTGGTGCAC 


TCACTTGCTA 


AATGGAAACG 


CCACACCTTG 


300 


GCTCGTTTTG 


GCTTTGGTGA 


AGGAGAGGGT 


CTCTTTGTCC 


ACATGAAAGC 


CCTTCGTCCA 


360 


GATGAGGATT 


CCTTGGATGC 


AACCCACTCT 


GTTTATGTTG 


ACCAGTGGGA 


CTGGGAGAAG 


420 


GTTATCCCAA 


ATGGTAAGCG 


TAACATCGTT 


TATCTAAAAG 


AAACAGTTGA 


GAAGATTTAT 


480 


AAGGCTATTC 


GCCTGACTGA 


GCTAGCTGTT 


GAAGCCCGCT 


ATGACATCGA 


GTCTATCTTG 


540 


CCAAAACAAA 


TTACCTTTAT 


CCATACAGAA 


GAATTGGTAG 


AACGCTACCC 


AGACTTGACA 


600 


CCGAAAGAAC 


GTGAAAATGC 


GATTTGTAAA 


GAATTTGGAG 


CCGTCTTTTT 


GATTGGTATC 


660 


GGTGGCGAGT 


TGCCAGATGG 


TAAACCGCAC 


GATGGACGTG 


CACCAGACTA 


TGATGACTGG 


720 


ACAAGCGAGT 


CTGAGAATGG 


CTACAAGGGT 


CTAAATGGTG 


ATATTCTTGT 


CTGGAATGAG 


780 



T 781 
(2) INFORMATION FOR SEQ ID NO: 306: 
<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

CCCGCATCTT GTAGGGTTTT AACGGGCACG ATTTTCATAT CCGTCTTGAT TGTTTTAGCC 60 

-GCTTCTAGGG CTGTTTGGTA GTTGTTTTTC GCGTCCGGAT GCGCCTTTTG TTCTTCTTCG 120 

CTAACAGGGT TATCAGGAGC AAAGAAAATA GCAGCACCTG CCCTAGCCGA AGCTACAACC 180 

TTCTTATCAA TACCTCCAAT GTCTCCCACA TTACCATCGC GGTCAATGGT ACCTGTACCG 240 

GCAACAATAC GACCATTACG AAGATCTGGG TGAGCTATTT GAGTATAGAT AGCTAGACTA 300 

AACATGAGAC CAGCACTTGG ACCGCCAATA CCAGCTGTTG AAAAGCTAAT TGGGACATTG 360 

CTGATTACCT CTGTACGGTC AATCAAGCCG ATTCCAATTC CATTTTTGCC ATTTTCCAAG 420 

GTGATGATTT TTCCTTCTGC AGACTTGGTT TGCCCATCCT CTTCATAGGT GACCTTGACG 480 

GAATCCCCTA ATTTTTGAGA ACTGACGTAA TCAATCAAGT CTTTGGAACT ATCAAAGGTC 540 

TGATCATTGA CTGCTGTGAC TGTATCAGAG ATATTGAGAA TCCCTTTAAA GGTTGAATTA 600 

TCCGTCACAT TCAAAACATA AACTCCAAAG TACTTGAGTT CGATATCCTT ACCAGCTGTT 660 

TTTAGTCCTT GATACTTGGC CATATTTTGC GATGTTTGCA TGTAGAATTG ATTGATTCGC 720 

ATAAATTCAA CATCGGAAGA ACCACCTGTA GTCTCCTGAG CACTACGAAT ATCTGTAAAA 780 

GGTGTCAACC AAGCATAAAT CATATGAGCT AAAGTGGCAT GTTGAACACC AACCGTAACG 840 

AATTGT 846 



(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 60 

CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAACG 120 

TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 180 

GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA ATCACCTTGC GCTTGACACC 240 
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AGTTACTTTA CAAATATCCC GATACATCAA GACGTTAGGA ATGATGAGAG CAATGGTTGT 300 

TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCGATGGTA GGTAGTTGCA AGACTAGCTT 360 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCGGAACA TGGCCTGAAG 420 

CATTGGAGAC AAGACCATGT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GCCAAACTAT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCGTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTAGGATT 720 

GGCTGAGAAA TAAGAAAACA TGACAACCAA GTCCTCATTG CTGTAGTTGG TAAACCAGCT 780 

CATACTATTG ATAAAGGTCA GCTGAGTCCA AATCTGGAAG AGCTGGATG 829 



(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

CGAACATCTT GCTGGCTGAT TCGTCTGCCG CCATCGCAGC CCCGAACACA TTGCGACCCA 60 

TGGCAAGCGG GCTCAATCCG CACATGGGAT CCGTGCCAAA GCCCCGCGTG TGCATCATTT 120 

GCTCATCTAG TAACGTATGA GGTTTGCCTT CGCTGTCGAT AAACCGATAT TCAATCGCAC 180 

CACTGCTCGT TCTCCGCGGA GGGGAAACCG ACTGCGGTAG GATGAACTCC AGAGAAGAGA 240 

GATCACGACC TACCAGGTGC GGCTCGTTGA AGCTGTTGCC GCTTAGCAGC AGGCTCGCCA 300 

CCACGCATTC CCAGAACTCA ACGGGGGTTT GATCGGCGTT CGGTTGCTGA CTAATAACTC 360 

GGTGCACGGG ATGCGAAGTG GCCACTTCTG GCACACCGTT CTTGTCTTCG TAGAGAGCAA 420 

TTGGGAGGGT GGCCAGCGTT TCGGCGATGA GGCGCACGCA GGCC 464 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCGTCTATAA 


TGGTAATAGA 


TTTTATTTGG 


AGGTTTTTAT 


GTCATTTCTA 


TCAAAAAATG 


60 


GAGCAGGTAT 


CTTGGCCTGC 


CTTCTCATTT 


CCATCCTATC 


TTGGTACTTA 


GGAGGATTCT 


120 


TCCCTGTGGT 


TGGCGCGCCC 


GTTTTTGCCA 


TTTTCATAGG 


CATGCTCCTA CATCCCTTTC 


180 


TCTCGTCCTA 


TAAACAACTG 


GATGCTGGTT 


TGACCTTTAG 


TTCCAAGAAG 


TTGCTCCAAT 


240 


ATGCCGTTGT 


CTTGCTTGGT 


TTTGGTCTCA 


ATATCTCGCA 


GGTCTTCGCA 


GTTGGCCAAT 


300 


-CTTCACTCCC 


TGTCATCCTG 


TCCACTATCT 


CAATAGCTCT 


GATTATTGCC 


TACCTCTTCC 


360 


AGCGTTTCTT 


TGCCCTGGAT 


ACAAAACTGG 


CTACCTTGGT 


TGGAGTAGGT 


TCTTCTATCT 


420 


GTGGGGGTTC 


TGCCATTGCA 


GCGACAGgCC 


CGTTATTGAT 


GCTAAGGAAA 


AGGAAGTAGC 


480 


CCAAGCCATT 


TCCGTTATCT 


TTTTCTTCAA 


TGTCTTGGCT 


GCGCTCATCT 


TTCCAACCCT 


540 


CGGCACCTGG 


CTTCATCTAT 


CCAATGAAGG 


CTTCGCCCTC 


TTTGCAGGGA 


CTGCGGTCAA 


600 


CGACACTTCC 


TCTGTAACGG 


CTGCCGCCAG 


CGCTTGGGAC 


AGTCTTTACC 


AAAGCAATAC 


660 


CCTCGAGTCT 


GCAACCATTG 


TTAAACTCAC 


ACGTACTTTG 


GCCATTATCC 


CTATCACGCT 


720 


CTTTCTATCC 


TACTGGCAAA 


GTCGCCAACA 


AGAAAACAAG 


CAAAGCCTGC 


AACTGAAAAA 


780 


AGTCTTCCCA 


CTTTTTATCC 


TTTACTTTAT 


CCTTGCCTCT 


CTCCTCACTA 


CACTACTCAC 


840 


CTCTCTAGGT 


GTGTCCAGTA 


GTTTCTTTAC 


TCCTCTCAAA 


GAACTCTCTA 


AATTCCTTAT 


900 


TGTCATGGAC 


ATGAGTGCTA 


TCGGTCTCAA AACCAATCTG 


GTCGCTATGG 


TCAAATCCAG 


960 


TGGAAAATCC 


ATTCATCATG 


GA 








982 



(2) INFORMATION FOR SEQ ID NO: 310: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1939 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CTAGCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 60 

CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA 120 

GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 180 

TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGGCTTCCC AGCTCAGATT 240 

TGCCAGCATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 300 

GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 360 
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ACCTGACCAG CTGGGAATTT CTCTTCTTCC TCTGTGGGCA TTTGTTAGTT GTGGCAATAT 420 

TATTTGGCTT TGGCAAGGGG ATAAACCTTG TCAAAACGGT TCATGTGCGC CACGGTAAGG 480 

CGGAAGCTGC CTTAAATCTT GAGGGTTTCA AAATCAATCG GTTAGGGAAA ATTCTGTTAG 540 

CTTCGATTGG AGGAATTCTT CTCTTGGCAG CTTTGGTTTc CTTGGTAACT TCCAGCATGT 600 

TTCAGGCTAA AAATTATGCC AATGTAGTCA CGGTTACGGA AAAAGACTTT ACTGAATTTC 660 

CTAAGAGTGA CACCAGTAAG GTTCCTATCC TAGATAGAAG TACTGCTGAA AAAATTGGAG 720 

ACCGCTACTT GGGTTCCCTA ACCGATAAGG TGTCGCAATA CGTAGCGGCA GATACCTATA 780 

CCCAATTGAC AATTGATGGG AAACCTTATC GGGTCACACC ACTAGAATAT GCAGACCCTA 840 

TCAAATGGTT TAACAATCAA GCCAAGGGAA TCGGTGAGTA TATTAAGGTG GACATGGTAA 900 

CTGGAAATGC GGATTTGGTG GACTTGAAGA CACCAATCAA GTATTCAGAC TCGGAGTATT 960 

TTAACCGTGA TGTCAAACGT CACCTGCGCT TGAAGTACCC GACCAAAATC TTTAAAACTC 1020 

CATCTTTTGA GGTGGACGAT GAGGGCAATC CTTTCTATGT AGCAACGGTT TACCAAAAGC 1080 

AATTTGGACT TGCTGTTCCT CGTCCTGCTT CAGTCATTAT CTTGGATGCT ACAAATGGAG 1140 

AAACCAAGGA ATACAGCTTA TCAGATGTTC CAGAATGGGT GGACAGGATC TATCCAGCAG 1200 

AGGAAACCAT TGAGCAAATC AACTACAACG GCAAGTACAA GGACGGTTTC TTGAATGCCA 1260 

TGATTTCCAA GAAAAACGTG ACCCAGACTA CCAATGGCTA TAATTACTTG TCTATCGGTA 1320 

ATGACATCTA TCTCTACACA GGTGTGACGT CGGCTAATGC GGATGAGAGT AATCTTGGTT 1380 

TCATCCTTGA AAATATGCGA ACAGGAGAAA TCACTAAGTA TAGCTTGGCT TCTGCGACAG 1440 

AAGAATCAGC CCGTGAATCA GCAGAAGGTG CTGTTCAGGA GAAATCCTAC AAAGCAACCT 1500 

TCCCAATCCT CATCAACCTC AATGACAAGC CTCTCTACAT CATGGGCTTG AAGGACAATG 1560 

CTGGCTTGGT CAAAGAGTAC GCCCTGGTAG ACGCAGTCGA GTACCAAAAT GTTATCGTTG 162 0 

CTACTACAGT GGAAGAGATG CTCAGCAAGT ATGCCAATAA AAACGACCTT G AAATTG AC A 1680 

ATGCAACGAC AGAAAGCATC AATGGAGTAG TAGCAGACCT CAAATCAGCT GTTATCAAGG 1740 

GAGACACTGT CTACTTCTTT AAAGTTGATG GCAACATCTA CAAGGTCAAG GCTTCAGTAT 1800 

CCGATGACCT TCCTTACCTT GAAAATGGTA AAACCTTCGA AGGTCAAGTA GGAAAAGACA 1860 

ATTATCTCAA GACCTTTAAG CTACGGTAAA AATAGGTTTT TTTCAGAAAG TATATGTTAT 1920 

AATAAGGTAA ATTAAGCCG 1939 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 



1339 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
CCTGCTAATA GAGAGAAAGA CTAGGAGTAG AAGTAAGCCA ATTAAATAAT GAGAAAGTTT 
CATACCCCGT CCTTTCATGT AGATTTGGTA TCGAAAGATA TCTGCGGATA TAAATGTAAC 
-ATTATTTTTC TAATCTGTCA ATAAAATTTC TGACAATTTA ATAAATACAA CAAGGAGAGA 
GCAACAAGAC TTTCTCCTTT GTTATCCTAT TCTAAAATGT TTTTACCTTA ATCTGATAAA 
ATAATATCTT CGAGGGAGTA GCTAGCCGTC CAATCAAGAT ATTGTTTAGC TTTTGAAGCA 
TCTGCTAGGA CACTGGCTGG GTCACTAGCA CGTCGAGCAA CAATCTCGTG TGGGATTTTT 
TAATTTAGTA ATTCTTCAGC AGTTTTAAAG ATTTCTTTGA TAGTATAGCC TTTTTTAGTT 
CCTAAGTTAA AGATTTGAGA AGAACTGTCT TCTTGAAATA GGTAGTTCAT TCCTTTAACA 
TGAGCCTATG CAAGGTCCAA GACATAAATG TAATCTCGAA TACATGAACC GTCACGTGTA 
TCGTAGTCAT CTCCAAATAT TTTTAAGCTA TCATTTTGTC CCAATGCGGT CTTGTTGATA 
TTTGGAATGA TGTGAGTTGG ATTTTTCACA CGCAGACCGT TTGAAGCATC CATTTCAGCC 
CCAGCAACAT TAAAGTAACG GAAAATAACA TATTTCCAGT CGTAGCGATT GGCCATCCAG 
TAAATCATTC GTTCGCCCAT CAGTTTTGTC TCTGCATAAG GGTTGACAGG GTCGAGCAGG 
GTATCTTCAG TCACCGGCTT GTCAATACAG TTATTTCCAT AGAGAGAAGC AGTCGAAGAG 
AACATGATTT TTTGAATGCC AACTTCAGAT AAGACTTTGA GAACTTGGTT CATACCAGCA 
ACGTTGG 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 
CCACATAAAG GTAAATATCT TTTGTACTAT CTTGGGCATC CAAGAAAAGC AATTGGGCAA 
TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CATAATGATG CGGTCTTTGA 
GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 
GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATGACTG AAGATAAGAC 
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TATTATAATA 


TCTTGGTCAA 


AAAAGGTCAA 


ATTTTTGCTC 


TGCTTTCATT 


AGACAGAAAC 


300 


AAAAACCCAA 


CCTCCTTTCG 


TGACTGGAAA 


TACTTTTCCA 


AGTCATTCTT 


CTTTTCGATC 


360 


TTATTTTGTA 


CCGAACAAGC 


GGTCTCCAGC 


ATCTCCAAGA 


CCTGGAACGA 


TATAACCGTG 


420 


TTCGTTCAAA 


CGTTCATCCA 


AGGCTGCTGT 


AAAGATTTCT 


ACATCTGGAT 


GAGCTTCTTG 


480 


AAGGGCTTTT 


ACACCCTCTG 


GAGCAGATAC 


AAGGCAGACA 


AATTTGATAT 


TTGATGCGCC 


540 


ACGTTTTTTA 


AGAGAATCAA 


CAGCCAAGAT 


TGCTGAGCCA 


CCTGTTGCCA 


ACATTGGGTC 


600 


TACTACAAAA 


ATTTGACGTT 


GGTCAATGTC 


CTCAGGCAAT 


TTCACCAAGT 


ATTCAACTGG 


660 


TTGAAGTGTT 


TCTTCATCAC 


GGTACATACC 


GATGTGGCCA 


ACTTTAGCAG 


CTGGAACCAA 


720 


GTTCAAGAGA 


CCATCAACCA 


TCCCGATACC 


TGCACGCAAG 


ATTGGGACGA 


TGGCCAATTT 


780 


CTTACCTGCC 


AATTGTTTTT 


GAACTGTTTT 


TGTAATTGGT 


GTTTCGATTT 


CCACATCTTC 


840 


TAGTGGAAGA 


TCACGAAGTA 


CTTCATACCC 


CATCAACATT 


GCAATCTCAT 


CTACTAGCTC 


900 


ACGAAAAGCT 


TTTGTAGAAG 


TATCTGTACG 


ACGCAAGATT 


GACAATTTGT 


GTTGAATCAG 


960 


TGGGTGATTA ATAACTTCAA TTTTTCCCAT TTTTGGAATT CCTTCTTTCA 


ATTTATTCTT 


1020 


CTTATTATAC 


CAAAAAACGG 


TTTAAAAATC 


TTTCTAAACC 


ATTTATTTTT 


GATAATTTTT 


1080 


ACATTAGATC 


AGCCTCTTTA 


AGAGCTGTCT 


GTACTGTCTC 


AAGTGGTAAA 


TGGGTCAATT 


1140 


CTGTCCCTTT 


TTCTTGATAA 


AGGTATTGGG 


CGTAGTCGTC 


CATTCGGTAC 


TGGTTGATAT 


1200 


AAACCACGCG 


CTTGCAGCCG 


ACCTGAAGCA 


ATTGTTTTGT 


ACAGTTGAGA 


CAAGGAAAAT 


1260 


GGGTTACATA 


GGCTGTAAAG 


CCTTTGGGAA 


CACCACGCTC 


AGCACCTTGA 


AGGATAGCAT 


1320 


TGACCTCAGC 


GTGAAGGGTG 


CGAACGCAGT 


GGCCTTCAAT 


GACCAAACAT 


TCGTGATCAA 


1380 


TACAATGCTC 


AGTCCCTGAC 


ACCGAACCAT 


TGTAACCAGT 


GGAAATAACC 


TTATTATCTT 


1440 


TTACCAGAAT 


CGCGCCCACT 


TTAGCACGTT 


TACAAGTGGA 


ACGATTCGCA 


ATTAGTAGAG 


1500 


CTTGGGCTGC 


AAAATACTCA 


TCCCAGGCCA 


GTCTTTTTTC 


AGTCATCTCT 


TTTCTCCTTT 


1560 


TTCTCTATTT 


TTTAAAAAAT 


GGTAAACCTA 


AATCTGCAAT 


CTTTTCAGCT 


GGTACCTTCA 


1620 


TGCCATCCTT 


GATCCATTTT 


AGAAGGACAG 


AGACGATGGC 


TGAGCTCCAG 


AAGGAATGAA 


1680 


GATAAGAGCT 


GACACCTTTT 


GATTTCCCAT 


GGTATTTTTC 


TAGAAATTCC 


TGCATGGCTT 


1740 


GGACAAAGAT 


TTTTTCCAGA 


TGGTAATCCA AGGCCAATTG 


AATTACTCTA 


GCTTCCTTTC 


1800 


TGGCCTCCCG 


GAAAAGGTGA 


ACCCAAACCA 


AATAAAGGTC 


TGTCTTTAAA 


TCGTAATGAT 


1860 


GCAGCTGTTC 


CATAATATTG 


TGGACAGTTC 


GTTTAAAGAC 


GCTCTCTAAA 


ATTTCCTCTT 


1920 


TGGAGTCATA ATTGCGATAA AAGGCCGCAC GCGAAACACC TGCACGTTTG ACCAATTCAG 


1980 


AAATACTAAT 


CTTGGTCAGT 


TCCTTTTTTT 


CCAAGAGTTG 


CAAGAGGGCT 


GTTTCAATGG 


2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGGT TTGATTTTCT GAGATTTTCA AGAGACTTTT 
CAGAGATTCT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATG 
CTTTTGTTTC 

(2) INFORMATION FOR SEQ ID NO: 313: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 base pairs 
<B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
ATCTGCACGA ATCAGGGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 
AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 
TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 
GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 
TTCGTTTCAC TTACGGTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 
AAATCAAAGG CGGAATCCTA GGTTTCAACT TTATGCTTCT TTTGGAACGT CGTTTGGATA 
ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCGTCA AGCTCGTCAA TTCGTAAACC 
ACGGTCACAT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 
GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT GAAGCAGTA 
(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 
CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 
CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 
AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 
CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTGAC 
TTGGATGACT TGGAACACCA GGAGTTTCTC TTTGAATCTC ATCTGCTGGA GAAGCTGGTA 
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CACCTTGACT TGGGTGAGTA GGCACGGTAG GAGCTTTTCT CATAATCTCC TCTACCGTTG 360 

ACAAGGAATC AGCCATGAGT TCTTCAGTTG AAGGTTCATT TGCAGGAGTG CGAACTACTG 420 

CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 480 

GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 540 

TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 600 

CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT GTTTCCCCTG CAAAGGTTGA 660 

CTGGGAA 66? 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

GGGAAGCCAA GGTATTTTAT CGGATGAAGT TGTTACTAGT TCTTCACCGA TGGCTACAAA 60 

AGAGTCTTCT AATGCAATTA CTAATGATTT AGATAATTCA CCAACTGTTA ATCAGAATCG 120 

TTCTGCTGAA ATGATTGCCT CTAATTCAAC CACTAATGGT TTAGATAATT CGTTAAGTGT 180 

TAATAGTATC AGCTCTAATG GTACTATTCG TTCCAATTCA CAATTAGACA ACAGAACAGT 240 

TGAATCTACA GTAACATCTA CTAATGAAAA TAAGAGTTAT AAGGAAGATG TTATAAGTGA 300 

CAGAATTATC AAAAAAGAAT TTGAAGATAC TGCTTTAAGT GTAAAAGATT ATGGTGCGGT 360 

AGGTGATGGG ATTCATGATG ATCGACAAGC AATTCAAGAT GCAATAGATG CTGCAGCTCA 420 

AGGGCTAGGT GGAGGAAATG TATATTTTCC TGAAGGAACT TATTTAGTAA AAGAAATTGT 480 

TTTTTTAAAA AGTCATACAC ACTTAGAATT GAATGAGAAA GCTACAATTC TAAATGGTAT 540 

AAATATTAAG AATCACCCTT CCATTGTTTT TATGACAGGT TTATTTACGG ATGATGGTGC 600 

GCAAGTAGAA TGGGGCCCAA CAGAAGATAT TAGTTATTCT GGTGGTACGA TTGATATGAA 660 

CGGTGCTTTG AATGAAGAAG GAACTAAAGC AAAAAATCTA CCACTTATAA ATTCTTCAGG 720 

TGCATTTGCT ATTGGGAATT CAAATAACGT AACTATAAAA AATGTAACAT TCAAGGATAG 780 

TTATCAAGGG CATGCTATTC AAATTGCAGG TTCGAAAAAT GTATTAGTTG ATAATTCTCG 840 

TTTTCTTGGG CAAGCCTTAC CCAAAACGAT GAAGGATGGG CAAATCATAA GTAAGGAGAG 900 

CATTCAGATT GAACCATTAA CTAGAAAAGG TTTTCCTTAT GCCTTGAATG ATGATGGGAA 9 60 

AAAATCTGAA AATGTGACTA TTCAAAATTC CTATTTTGGC AAAAGTGATA AATCTGGGGA 1020 
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ATTAGTAACA GCAATTGGCA CACACTATCA AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACGTT TTACAGGATT 1140 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTTAAAGGAG AGAGTGTACA 1200 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 1260 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CCGATCCTAA 1320 

AACAAAAGCG ATACGAGTTG CAAAAGATAG TGCAGAaTwT TTAGGAAAAG TATCAGATAT 1380 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT l$83 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2453 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
CCTGAACGCT TTTTTATAAA TATCATAAAG CCAATCTGAT TTATCAAGTG TGTCTAAGCG 60 

ACGCGAATTA AAATTCATTG CATACTCCAT CGCTTCTAAA AAACTCATTT TTGAAAAGAC 120 

GTTAAAATCA TCTAAATTCT GACTCCAATA TAATAACAAA ACCAATCCCA TAATATCCTC 180 

TGGTTGATTA TTCAATAAAT TTAAGTTGGT TTCATAAAAC CCTGGAGTTC CAAATAGAGG 240 

CAACTTTTTT TCTTCAATTT GAGTTTCTTT CCTTAGGGCA TGCTCAAAGT CTATAATATA 300 

AATATTATTT CTATTATCAA TAAGTATATT ATTAAATGAT AAATCTCTAT AGGAAAGATT 360 

ATATTTGGAG TTTATTATCT CCATATAATC AATTAATGTT AAAAACCAAT CATACGAGCC 420 

ACTAACCATA TTATACTCGC TTAATTTATC TGCAATAATA AACTCAAATT CCACAAAATA 480 

CGAATTCTTT ATGTAAAAAT CGTTAAAAAC TTTTGGAGTA AATTCCTCCT TTTCCAATTC 540 

TACTAATATT TCTCTTTCAT TTATTAAACG ATTCACAGAA TCTCTATTTG TAAAATCAAC 600 

CAACGATAAA TCACTAGCTT CTTTTAATAA AGAATAAACT CGCTTTTGAG TATTAAATAC 660 

TTTATAAACT CCACCTTTGG CATTTTTAGA AATCACTTCC AAAATAATAT ATTGATCAGG 720 

AATAGTGTTA TATCTTGGAA TATAGTAATC CCTTATTGGA ACATTCACAT TTGAAGGGAT 780 

TTTCTTATCT CTTTTATCCT TGAAAGTGCT ATCTTTTACG AACTCCCCAT ATCTGTAATA 840 

TACAACCTCG CTAAGTTGAA ATCTGAAATC TGATGGTATG TTTACACCCT TTACACCTTT 900 
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ATACAATATT 


TCTAAT'PTfIT 

i V. Innl i lui 


\j l AAt— AAAHj 


nvrv* AAA /*»rrwn 

1 1 uAAAL Tv_ I 


TTATTATCTT 


TTGGATAAAT 


960 


TGTAATGAAT 




vs i oAA 1 AALL 


Ai IAAGCCCT 


GTATTTTGCA 


AAGAAAGTTC 


1020 


TTTAATGCTA 


A r*P A A A A TTT 


TV*7A & A rnrnrn a T 
1 u AAA 1 1 1 A 1 


CTTCTTCTCT 


CTAGAAAATA 


TAAAATCAAA 


1080 


GAATTTTTTA 


RPAAPPAAAT 


1 AoLA 111 AA 


1 ATTCjAAIjCG 


CTCAGGTGTA 


TTTTAAATCC 


1140 




\j J.vj/» 1A1 1 Au 


A f*f*fT* AAA fTVn 

Al- WjCAAA 1 i 


A FTI A m A A S^^l A * 

ATATAACCAA 


TGTTCATCAC 


TAAAATTATC 


1200 


*\\~ 1 A/l 1 1 1 1 A 


TATTCTAATA 


ATAAATTATG 


GTATGCGTCT 


TCTATTTCAG 


TTTCATAGTC 


1260 


C A A A T 7i r* f T w T lf P 


AAA 1 At, 1 X I I 


CGTAATTCAT 


ATTAAGAAAT 


CTTCTCCATA 


AATTTTTAGA 


1320 




> A r*C*C* A A A /""< A 

AAb LLAAACA 


ATTTAAAGCG 


TGATAATAAA 


TGTTGATAAT 


CAATGTAACT 


1380 


1 1 L-Alj 1 1 


TATTTTGTAA 


TTCCTTCACC 


AATAATTTTA 


TGCTATATCT 


ATTTTCTCGA 


1440 


f*f*f* A Jv nvtvn a m 

uOLAATTTAT 


AGGACTTCAA 


GATAAAACCA 


TAAAAGAGAT 


AAGTATTATA 


ATCTGACAAT 


1500 


v- vJAu I 'I i\_ ACj 


AATAATTTTT 


TAGAAAAATA 


TCTAGTGATT 


CTGATAATTC 


ATCCGGAATA 


1560 


ATTCTTTTAA 


CATCGTATTT 


ATTTTTCATA 


TCGGCCACTC 


TTCCTTAAAA 


AGCTCACAAT 


1620 


71 K 7i % T^OVTvn TV 71 
nnnnl 1 1 XAA 


ATTTCTATAC 


AACAATCCGA 


GAGTAGTCTC 


ACAATTTGAA 


CATTTCACAT 


1680 




ATATAAAAAA 


TGAATTAATC 


AGAAACCTCT 


GACTAAGATT 


TCCTAATTAA 


1740 


TTCACTTTCT 


ATATCATAGT 


AAGGAATTCT 


ATTATCCCTA 


ATTGAAAATT 


GAAATTTTAT 


1800 


GTTTTATATA 


TTAACAATTA 


TGCGGATTGT 


AAATCTTGTC 


TAACAAAATG 


GCAAGTGCTA 


1860 


CTATGTGCCC 


CAGAAGGCGA 


TGCAACGCTA 


TTTTGAATTG 


AAAGAGCATA 


ATCATCCATA 


1920 


TCATTTAAGT 


CACGGATTAG 


CAATGCTTCC 


TTCTCTCTTC 


CGACAATTCC 


AAATTTTCTA 


1980 


ATTACCTTTT 


CAGGATTATC 


AAAAAATTCT 


CCAACAACTT 


CCATATTTCC 


TTGAAGTTCA 


2040 


TTCAAGAAAG 


CTTTCATTTG 


ACTACTCATT 


ATATAGCTCC 


TTTTCTATTA 


CTTTATTTGG 


2100 


AATCAAAACT 


TACTTGTACA 


TTGGAAACAC 


CTCTATTCTA 


CGCTTTCATA 


TTGCTGCATG 


2160 


ACACTTTCAA AATCAAATTG 


CTAAAAATAA 


TTTTTTAAAG 


CTTAATTTAG 


ATTTAATTAC 


2220 


ATATATCTCA 


AAAAATTGTT 


TTGAAATTAG 


TAAATTAAAA 


TAGGTTTCTG 


TACTTATAGG 


2280 


AACTAGTTAT 


AAAAACTTCG 


CCCATCATAA 


AATATCTATT 


TAAGTAAAAC 


AAAAATTTTA 


2340 


TAATTTTTTG 


ATTTTTAAGT 


GACTATAATC 


TCCTATCTAT 


AAATACCATT 


CGCAGGACCT 


2400 


GGATCAATCC 


CTCTAGCCAT 


CTTATGAACT 


TGAGTTCCTC 


CAGACAGTCC 


CGG 


2453 


{2J INFORMATION FOR SEQ ID NO: 317: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



CCAATTTGAA 


GGCTCTAAAA 


CAATGGAAAA 


GTGCTACACA 


GATGTGACAG 


AATTTGCCAT 


60 


TCCAGCAgTA CTCAAAAACT 


TTACTTATCA 


CCAGTTTTAG 


ATGGCTTTAA 


CAGCGAAATT 


120 


ATTGCTTTTA 


ATCTTTCTTG 


TTCGCCTAAT 


TTAGAATAAG 


TACAAACAAT 


GTTGGAACAG 


180 


GCATTCAAAG 


AGAAGCACTA 


TGAGAATACG 


ATTCTCCATA 


GTGACCAAGG 


CTGGCAATAC 


240 


CAACACGATT 


CTTATCATCG 


GTTCCTAGAG 


AGTAAGGGAA 


TTCAAGCATC 


CATGTCACGC 


300 


AAGGGCAACA 


GCCCAGACAA 


CGGCATGATG 


GAATCTTTCT 


TTGGCATTTT 


GAAATCGGAG 


3 60 


ATGTTTTATG 


GTTATGAGAA 


GAACTTTAGA 


TCTTTAGAAA 


ACCTTGAACA 


AGCTATTGTG 


420 


GACTACATTG 


ATTATTACAA 


CAACAAGAGA 


ATTAAGGTAA 


AGCTAAAAGG 


ACTTAGCCCT 


480 


GTGCAATACA 


GAACTAAATC 


CTTCGGATAA 


ATTAATTGTC 


TAACTTTTGG 


GGTGCAGTAC 


540 


ATTTTTGGTA 


TATATAAAAT 


TTGTAGGAGC 


TATATCTACA 


ATTTTATATT 


CCCAGTTTAT 


600 


GGATGTAACT 


TACTATATTC 


ACAATGTTAT 


CCAGTGTTTT 


TTCTCTAATA 


TTTAAGGAGT 


660 


GTTCTGTTTC 


TCGAATAAAT 


TCTTCAAAGT 


TTAACCCGTC 


AACTTGTTCC 


TGAACAAGAA 


720 


AATAATCATC 


CACGATATAA 


AATTCATCAG 


TTAAATTAGT 


AGTATAACTT 


TTATCGGCTA 


780 


ATTTTTTTAG 


CATGTGAGCT 


TCATTTTTTA 


TATCATCAAG 


AGCTGTCCAT 


TCTCCTTCAG 


840 


CATCATAATT 


CACAAAAGGT 


CTTGACTGCT 


TGATGATTAC 


TTTTTGCCCG 


TCCGATTTTC 


900 


TAATTGCCCG 


ATAAACATTT 


CCTTTATTTG 


ATCTCTTAAT 


AATTTTTTCC 


ATTTTGTATT 


960 


TATTTATTGC 


AGAGTCCTTA 


CTTGAAACTT 


CACATGTGGT 


TTGAAAATAA 


ATCCTTTTTT 


1020 


CTTCTTCTGA 


AAATAAATCC 


ATTTTCCGG 








1049 



(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AGAATCAGAA AATCGCCGAA GTGGTTATTT ATTTTTGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 
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CGAAGTATAA TGATATTGCC TTGCTTGCAG ATTTGATGGG ACACGAAAGT ATAGAAACTA 240 

CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTGTTA 300 

ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTGCT ATTTTTGTGA TTATGGCTCT 360 

TATTATGGGA ATATACCTAT GAATTGGGTT GTTATAAAAA TAAAAGATAT TTTTTCAATA 420 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAAGCATTA ATAATAAAGG TGTTAGAATT 480 

ATACGTGGTG GTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC ATAATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTGGAAAG TTTGCAAGAA TCGAGAAAGA CTATGATGGT 660 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCG AAAGTGCAGA GATGATGTCA 720 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 776 
(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TGCAATGCGG CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 120 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGCGCATG 180 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCGATGCCT GCTTGCCGAA TATCATGGTG 240 

GAAAATGGCC GCTTTTCTGG ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT 300 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 360 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC CTTCTATCGC 420 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGGAG TTCTACCGGC AGTGCAAATC 540 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCGATCC 658 
{2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1475 base pairs 

(B) TYPE: nucleic acid 
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(C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



CCGGCTTAAT TTTTAGAAAA CGTGGGCAGG GAACCTTTGT TCTCTCTCGT 


GGCAGCTCAA 


60 


AAAGAAAATT AATCGTTCCA GAAAGAGATA TCCGGGGACT GACAAAAATA 


TCTGAAGATG 


120 


CTCATTCTAC AATTGACTCG AGGATTATTC ACTTCAAATT AGAATTTGCA 


AATGAATTTT 


180 


TAGCAGAAAA ACTACAGGTC GCTTTGCAGA GTCCAGTTTA TAATATTTAC 


CGCCTGCGTA 


240 


TTATTGACGG TAAACCTTAT GTTCTGGAAC AAACTTATAT GAGTACCGAT 


GTTATTCCAG 


300 


GTATTACTGA AGATATTTTA CAAAAATCGA TTTACAATTA CATTGAAGGA AAGTTAGGAT 


360 


TGCATATTGC CAGTGCTACA AAAATCTTAC GAGCTTCTTC TAGTTCAGAA 


AATGAGCAAC 


420 


ATTACTTGCA GCTCCTTCCA ACGGAACCGG TATTTGAAGT AGAACAAGTG 


GCTTATTTGG 


480 


ATAACGGAAC TCCGTTTGAG TACTCGATTA GTCGTCATCG CTATGATTTA 


TTTGAATTTA 


540 


ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC 


CAATCTTTTA 


600 


CAGACTCTAG TTTAAGAAAA ATTTAAAACA GGGCAAGAAG GTCCCATCTA 


TGCTTAAATG 


660 


GTTTCTCTTT TCTAAATAAG ATGGCTTTAA AAGAGTGATC GTTGTATCCA 


TCATGTTGAA 


720 


AAATATCTTC GTATAGCTTA TAGAGTAGGT ACTGAAATTG TTCACCTGAT 


CTACTTCTTA 


780 


TAGTTATTTA GTTTTAAATA GTGTTTCAAA CATTCTTACA CTGACGAGAA 


GTTTTTGAGT 


840 


CTTTTCTTGT AACACATATA GTATACTGTG GTTAGAATAG TAGACTGTGA 


CTTCTAACAA 




ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTATTTGTTC ATATCTTCTT 


TTAATATATT 


960 


AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT TTTTATTTTT 


CATTACGAAT 


1020 


AATATAGATG AAGGGGAAAG AGTATGAAAA CAGAACTGTT TCTTTTGCTA 


TTAGTTCAAA 


1080 


AGGAGAAAAA ATGAAAGTAG AAAATATTTC GTATAGGGTG GATCATCGTA 


AATTGTTTGA 


1140 


TAATATTTCT TTTGATACTT CGAGTTCAGA CGTGACATTA ATTACTGGTA 


AAAATGGTAC 


1200 


AGGAAAGTCA ACTTTACTAT AGTAGATTGA AACTAGAATA GTACACATCT 


ACTTCTAAAA 


1260 


TATTGTTAGA AATCGATTTG ACTATCCTGA TCTATTTGTC CTGTTCTTAT 


TTCATTTCAC 


1320 


TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATGTC CTGGGAACGC 


ACTTTCTTCA 


1380 


TATTTTTCAT ATTCTTGAAT CCATCGATAA AGACTATTGG GATGAATTTT 


TAAAGTTGAA 


1440 


CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 




1475 


(2) INFORMATION FOR SEQ ID NO: 321: 







1348 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATAGTGAG CAAGCTAAAC TTAGCATTTC ATGCCCTCAT 

ATGGGATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 

TAAGTCTGAC GCGTAGACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 

TGTTCAAATC CCAACACATT ACCTGAGAAG AAAGTTGCAA TGTTGTTTTT GGTGCGGGTT 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC AGCTCTTCTT 

TGCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGAGTTTTT 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTAGAAAAAT CATAACCTTC CCTATCTACG 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGGGA AAGGAAGATA AGAAaCGTAT 

TTAGCCCATA ATCTATAAAG 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTCACC TGGAGTGAAC CTAGAACGAT 
AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 
CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 
GCCTGAGGAA CTACGGGTGG TGGCTTTTCC GAAGAACGGC TCCGGGTTTG ATGACGAGGG 
TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 
TCGGGCGACT GGAGGAATGG CGCCTTTTTC CGGTTTGACC AAAGTGGCAT CACATGGTGT 
CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 
TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 
TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGGGTA GGCAGCCGAG AAGAACTCTT 
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ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTGGCCA GCTTATTGTG 600 
GCATTTAGAC GGTCAAACGG AACGAGCACG TAGGGTACTG TGC 643 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

GGTACCCACT CATTCTTGAT GAATTGTGAA CAGTTGCCCT TGGGTCGTTT TGCGAGTTGA 60 

AGTCAAGAAG AGGAAAAAAA CAAAAAGGAG AAATACTCAT GGCAGTAATT TCAATGAAAC 120 

AACTTCTTGA GGCTGGTGTA CACTTTGGTC ACCAAACTCG TCGCTGGAAT CCTAAGATGG 180 

CTAAGTACAT CTTTACTGAA CGTAACGGAA TCCACGTTAT CGACTTGCAA CAAACTGTAA 240 

AATACGCTGA CCAAGCATAC GACTTCATGC GTGATGCAGC AGCTAACGAT GCAGTTGTAT 300 

TGTTCGTTGG TACTAAGAAA CAAGCAGCTG ATGCAGTTGC TGAAGAAGCA GTACGTTCAG 360 

GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT TACAAACTGG GGAACAATCC 420 

AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTATGGA AGAAGATGGA ACTTTCGAAG 480 

TTCTTCCTAA GAAAGAAGTT GCACTTCTTA ACAAACAACG TGCGCGTCTT GAAAAATTCT 540 

TGGGCGGTAT CGAAGATATG CCTCGTATCC CAGATGTGAT GTACGTAtTG ACCCACATAA 600 

AGAGCAAATC GCTGTTAAAG AAGCTAAAAA ATTGGGAATC CCAGTTGTAG CGATGGTTGA 660 

CACCAATACT GATCCAGATG ATATCGATGT AATCATCCCA GCTAACGATG ACGCTATCCG 720 

TGCTGTTAAA TTGATCACAG CTAAATTGGC TGACGCTATT ATCGAAGGAC GTCAAGGTGT 780 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 60 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 120 



WO 98/18931 



PCT/US97/19588 



1350 

AGGAAAAGCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATGAACGGA GCAAGGTGTT 180 

TGACTTTTCC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 240 

GACTACTTAT CAGTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAAGGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCGTAT CTCTGCCTAA 360 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGGACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 480 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCATgAAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCAGTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 624 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 



TCTTATGAAG 


CCGAAGCGTG 


ATTTATGGCG 


GATAGGTTTG 


GTCTGCAGAA 


AGTGACAAAT 


60 


CTAGTGCCAT 


CAGCGTATAT 


GGAATCTnTG 


GCTGAGAAAC 


AGTCCCGGGG 


TGAACTGACT 


120 


TATGAGCAGG 


TTTATGAGGA 


TGCAACGGCT 


TATCATCATA 


CCATTGATGC 


GAGTACAGAG 


180 


GAGGCAGACT 


TGGTTTCTCT 


ACGTATTGTA 


GAACTATTGT 


CTCGAAGAGG 


CTTTAGCTTC 


240 


AGTCCTGCGA 


TCTTACTTGC 


TATTCATAAG 


GAGTTGTTTC 


AAGATATATT 


TGAACCCTCG 


300 


ATTCCGGTAG 


GTCAATTTCG 


TCAGACTAAT 


ATCACAAAGA 


ATGAACCTGT 


TTTGAATGGT 


360 


GAAAGTGTTG 


TGTACTCTGA 


TTACTCCATG 


ATTCAAATGA 


CCTTGGATTA 


TGATTTTAAT 


420 


CAGGAAAAAC 


AAGTTGCATA 


TGCGACACTA 


ACCCAGGCGG 


ATATGGTTAA 


AAAAATCCAG 


480 


CATTTTATTT 


CAGGAATCTG 


GCAGATTCAT 


CCATTTCGCG 


AAGGAAACAC 


TCGGACGGTA 


540 


ACGGTATTTT 


TGATTCAGTA 


TCTTCGTGAG 


TTTGGTTTTG 


ATATTGATAA 


TACACCATTT 


600 


CAGCAACATT 


CCAAGTATTT 


TCGTGATGCC 


TTAGTGTTAG 


ATAATGCAAA 


GATTTTACAG 


660 


CGACGTCCTG 


AGTTTTTAAC 


AGCTTTTTTT 


GAAAATCTCT 


TGCTCGGTGG 


TCAAAATGAT 


720 


TTGTCTTCAG 


AAAAAATGTA 


TCTAGATTTA 


GACCTCGATC 


TTTCATAATC 


CTAATACTGA 


780 


GTAAACATTG 


AATTTTAGGA 


AAAAATGAAG 


TAAATATTCT 


CACAAGAAAA 


CGTATATCAT 


840 


CAAAGTTTGG 


CTCTTTGTCA 


ATTGTAGTGG 


GTTGAAGAAA 


AGCTAAGTTC 


GAGAAAGGGC 


900 
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AAATTTCGGC CTTTCCTTTT TGATGTTCAG AGCGATAAAA ATCCGGTTTT TTGAAGTTTT 960 

CAAAGTTTCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 1020 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAT AACCTTTTCT TTATCTTTGA 1080 

GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTG CTTAAGATTG TCCTCGATAA 1140 

GTTCGAAAAA TTTCTCCGGG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 1200 



-GATAGTGATG TATCAAGTCT TGTGAATAGC TCAAAAG. 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTGCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGGTTA AAACATTAGG AAGTAATCTA TTTAGACTCA CTACTAGCTT AAAACATGTT 240 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTTTGTT TTCAAAAGAT 300 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 360 

GAGACAAAAG AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 420 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 461 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA CAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 120 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 
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ATAGAATTCA TAAAAGCCAT CGCTCATTAA AACAATTTGT TCACTAGTAA CATCTATTTG 240 

ATTAATAATA GCATGGTCTA AAAATCTCTC ATCCAACGAA CCTATCCAGT ACCCACTCGG 300 

TTGATTAGAT AATTTTCTGA TTTTTTGTAA AATAATTTTT TTATTTAAAA CACTATTTGT 360 

ACCAATTGAA TCTTTTATCT CATTTTTCCC TTTTTCAAAT AAGTTATCTA CTCTATGATC 420 

AGTTATTTCC ATTTCGTTTA CTAACATGAC GCAGTCACCT AGCATCATAT ACTCCAACTT 480 

TTTTTCTGAA AGTTTAGCAA ATATTGGTAA GCGATAATAT AGTATATTGA AACTAGAATA 540 

GTACACCTCT ACTTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC 600 

CTATTATTAT TTCATTTTAC TATACTCTGT TAATTTATAT GAGTTTAAAC CGATTTCATC 660 

TTTAACCTCG AGTAAAGCAG TTTCAAATAT TTGTTTAAGA GTTTTTGATT CTTTACAATT 720 

AACCGACAAA CTTTCTGATA AAATATGTAC AACTTCTGAG ACTGAATAAC CTATCTCCTC 780 

TTTAGAATTA TATAAATCTG TAGCTCCACC AATAATCCAA AAATACTGAT TTTGTGAACC 840 

TACAATATCC TCATTTTCTA CGGAACTTCC TTGTATCGAA CAAATTTTAT TTATCTTTAC 900 

CATAATACTT CAACCCTTTT AGTGTCAAAA GTAAACCAAT TCCTGTCACT GTTAAGAATA 960 

GTTCCATAAT CTTATTCGAA CCAGTCTTTG GTAATTTTTG TTTkACATCT ACTATyTCTT 1020 

TAGATTTATT AATATGATTT TCAGTTTCTC TGCCATCTCC AACTATTTTA TAGTTTACTT 1080 

CTTCTGTCTT ATTATCTTGT TTATTGTCGA TCTTGTCATT CATTTGTCTA TTATCTTTAC 1140 

TTGAGTTAAA CTCTCCGTTC TTCTGGTTAC TATCAATTAC ATTATTTGAA TTAGATTGTT 1200 

TTTCCTCTTT GTTTTTTTCT TTTTCGTTTT TATCACTTAA ATTATTTGTT ACAATTTTGT 1260 

AAAGCCCATT CTCCGTTACA ATATTGAAAT TACCATCGCT ATCACGTATA ACAGGTTCTT 1320 

TCCCATTTGC ATTAGATTTG ATGAATGATA TATACTTACC GGATAAATTA TAAAATTGGT 1380 

TATTTAAAAC GGTTATTTTA CCCTTTGAAT CCTCAATAAC AATTCCTTCT TTACCC 1436 



<2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

CCGGCAGACA GGAGAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 60 

ACGTATTTTC CGTCTTTCTT CCGTTGCTTG ACACACTCTG TGAGGAGATA TTCGATTTGC 120 

CCATTGACTG AACGAAAGTC GTCTTCTGCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 180 
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GAGAGTCGAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AGGCTTCCTG ' 240 

TGTTGACAAT TGGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 300 

TGGCAGCTTT TCGTTCTTCG TCAAGTTCTA CCAATTCCCC TTCATTGAGC CGTTCTAGTG 360 

CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 420 

ATGCTTGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTGA 480 

-TACGTGCTTC AAGGATTTCC AAGCCAGCAT CCTCAACACG ACTTTGGATT TCTTCACGAA 540 

TACGGGTAGC AACAATTTCG CTAGAGCCAC GGAGACTACC TTCATCTGCG TGCCCATCAC 600 

CCGGAGTATC CACATTAGGA GACACATCGT AAGGATAGAT GCGGAC 646 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 
GTTGCAGGTG CAGTAGGTGT TACTTCAGAT ACATTTGAAC GTGCAGAGGC TCTTTTTGAG 60 

GCAGGAGCGG ATGCGATTGT TATTGATACT GCACATGGTC ATTCTGCAGG TGTCTTGCGT 120 

AAAATTGCCG AGATTCGTGC TCATTTCCCA GATCGGACTT TGATTGCTGG AAATATTGCT 180 

ACTGCTGAAG GTGCACGTGC CCTTTATGAA GCGGGTGTAG ACGTTGTTAA GGTTGGTATT 240 

GGACCAGGTT CTATCTGTAC TACTCGTGTG ATTGCTGGTG TTGGTGTTCC GCAAGTAACA 300 

GCTATCTACG ATGCTGCAGC TGTTGCGCGC GAATATGGTA AAACGATTAT TGCTGACGGT 360 

GGGATCAAGT ATTCTGGAGA TATTGTAAAA GCACTTGCTG CAGGTGGAAA TGCTGTTATG 420 

CTTGGATCTA TGTTTGCTGG AACTGATGAA GCTCCAGGCG AAACTGAAAT CTTCCAAGGA 480 

CGTAAATTCA AGACTTACCG TGGTATGGGA TCAATTGCTG CTATGAAGAA AGGTTCAAGC 540 

GACCGTTATT TCCAAGGTTC TGTCAATGAA GCAAACAAGC TTGTTCCAGA AGGAATTGAA 600 

GGTCGTGTTG CTTATAAAGG AGCGGCAGCT GATATTGTTT TCCAAATGAT TGGTGGTATT 660 

CGCTCTGGTA TGGGTTACTG TGGTGCAGCT AACCTTAAAG AACTACACGA TAATGCTCAA 720 

TTTATTGAAA TGTCTGGTGC TGGTTTGAAA GAAAGCCATC CTCATGATGT GCAAATTACT 780 

AATGAGGCAC CAAATTATTC TATGTAAAAA ACAATGAAAA GAACTCCAGT GAAAACAGGA 840 

GTTCTTTTAC AATGTTGTCA ATTTCCATTT ACAGCAGCTT TACCATCCTG AATAGTGAAG 900 
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ATACTTAGAT TTTCTGGCAG ATTTTGAAGA TGGTCTAAGC TTGTTGTTGT GATAAAGGTT 960 

TGGATTGATT GAGAAATCGT TTCTAATAAT TTTAACTGTC TAGTGTTGTC AAGTTCACTC 1020 

ATCACATCGT CAAGCAGTAA TATAGGAGAT TCTGTGGTAA TGCTTTCCAT TAATTCGATT 1080 

TCTGCTAATT TTATCGAGAG GACGAGACTA CGATGTTGAC CTTGGCTTCC GAAACTAGCA 1140 

TCCATCCCAT TTATATAAAA AGAAATGTCA TCTCGATGAG GACCGACACC AGTATTCTTT 1200 

TTAAATAAAT CTCTGGATCT ACTTTTTTCT AAAGCAATTT TGAAAGATTC GGATAAGTTT 1260 

TGTTTGTCAG TTATATTGAC AGAAGATTGA TAGGATATTG ACAACTCTTC GATCTGATTA 1320 

GAGAGTTCAA AATGTTTCTT ACGCCCAAAT GATTCTAGTT TTTTTATGAA ATCTAAGCGG 1380 

TGATTCATTA CACGACATCC ATAATCAACT AGCTGATCAT CTAACACAGA AAGGAATGTT 1440 

TCATCTATTT TTTGAGCTGA TTTTAGGTAA GTGTTTCTTT GCTTTAGGAT GTGGTTATAA 1500 

TTGGTTAAGT CAGATAAATA GATTGGCTTA ATTTGCCCAA GTTCCATATC AATGAATTTT 1560 

CGTCGAATCG AAGGTGCTCC TTTAATTAGT TGTAAATCTT CAGGAGCAAA TAAGACAACA 1620 

TTCATGTGTC CTACATAATC TGAAAGGCGT GCC 1653 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT AGTTTAAAAT TACTCCCATT CTTCTTTTCC 60 

AAACGTACAA TATATCCAAA ACCATTCAAA ATACTAGATT CTATTTTTTA TAATATCACT 120 

AAATCCACCT AATTATAGGA CGTTTTCAGA TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 240 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 300 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 360 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TCAGACTATA 420 

TATGCACATA TACACTTAAA TTCATATAAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTGGAAGT 540 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAAGAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GAGTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 720 

AATTTAGTAC CACTCATTTT AGATATTTTT TCAAGCGTGC CTTGATTTAC ATTCGTTGCA 780 

AAATGATCAT CTAAGAATCT TGCTACATCT TCAAGTGCTT TATCATCATC AATTTCAATT 840 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 900 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTGCC 960 

ACTCTTTCTA AGGCCTTTGT AAGTATTATT TGATTCGGTT GAAGCAAGGT TGAATAAAAA 1020 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 1080 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 1140 

TTATCAAAAT CAAAAGTACA TAGAGAGTTT GCGGTAATAG TTTGATACGG ATTAAACTTT 1200 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CATAGTCTGA CATACTATAA 1260 

GTGCTATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCGAT AATGAAGATG 1320 

CCTCCTTCAT AAAACCGGTA 1340 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 607 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTGATCC GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TTCTTTTTAT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATGATG AACGTCTTCA ATTGGAATAT 240 

AGTGATTTAC GAAATCGTAT TTTAGCTGAA ACTCCACAAA GTTCTTATTC TTGTTTCCAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 360 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGGAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 480 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 

ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 600 

CGGGAAA 607 
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(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



TTAAAATACC 


GAATTTTGTT 


TTGTCCTCTA 


TTTCAACATT 


GTGAATCGCC 


TCAGGCAGAG 


60 


AACCGATACT 


AAAGATATAA 


CCAAAATAGT 


TGTCATTTGC 


TTTACCGATA 


TCAATCTTAT 


120 


TGGTTAAATC 


AAAATCCAGT 


TCGTCAATTG 


CGCCATCGAT 


GTCTTGATTG 


ATTTCCAAAA 


180 


GTTTTGTAAT 


GAGGTTACCC 


GTACCGCCTG 


GGATAATCCC 


TAACTTAGGA 


ATGTAGTCTC 


240 


TCTCATCAAT 


ACCTGAAATG 


ACTTCATTGA 


CAGTTCCATC 


TCCACCAAAC 


ACAACCACTG 


300 


CATCATACTG 


CTCACGAGAA 


GCTTCTTCAG 


CAAAATGTGT 


TGCATCCAGC 


GCTTTTTCGG 


360 


TAATTTTGGT 


TTCAACATAT 


TCAAAGTATT 


CTTTTGCTTT 


ATTCTCCAGC 


TTTTCTTTGT 


420 


AATCCAAAGC 


CTTCTCGCCA 


CCAGAAGTAG 


GGTTGATAAT 


TACCATTGCT 


TTTTTCATTG 


480 


ATTTTATCCT 


TAATTTTAAA 


CAGAAATGTT 


TACATTTCGT 


CGTATGCAAG 


TAAATGTAAT 


540 


CCTATTATAC 


AATGAAAATA 


CAGAAAAGAG 


AAATCTGACG 


TACTGGAGAT 


TAATACGCTT 


600 


TTATTCTATT 


TTCCCATCGC 


CTAACTACAT 


CCTTTAAGGG 


TTCATCCAAG 


TAAGAATAGG 


660 


CCTTATCCTT 


GATCCAATCA 


GGAATACCGT 


AAGCTGCCTC 


TGCTAwGCTA 


CAAGTGATTG 


720 


CTGCGAGAGT 


ATCACTGTCG 


CCACCAAGTG 


AGATGGCATT 


TCTTATCGCA 


TCTTCGAAGT 


780 


CTCTACTTTC 


AAGAAAGGCG 


ATAATGGCTT 


GAGGGACAGT 


TTCCTGACAT 


GTTTCGTTAA 


840 


AACGATAGTT 


AGGACGGATT 


TCATCTAAAG 


TTTGAGATAG 


ATTGTAATCG 


TATTCTTTTT 


900 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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GGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCGC TGGTGAGTTC 



240 



AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTGG CCTACACTCT CAGAAAGAAA 



300 



CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 



360 



ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGCAGACCG TAGAAATGGC 



420 



AGAGCTGCTG AGCAAGCGTG GCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 



480 



GAAGGTAGTG GTGTCAGCTG TCCTCTACAG CATGCCTATG ACTGGTGGCC TTC 



533 



(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 544 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CCAGCAAACT AGGAAGCTAG CCGTAGTTGC TCAAAGCACA GCTTTGAGGT TGTAGATAAG 60 

ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 

TCAAAACACT GTTTTGAGGT TGCAGATAGA ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 

AGGTTGCAGA TAGAACTGAC GAAGTCAnnA ACCACACCTA CGGCAAAGTG AATCTGAAGT 240 

GGTTTGAAGA GAGTACAACT TGTCTTTTAG AAAAGGAGCC TATAATGAAA GTCTTTCAGC 300 

ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 360 

TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGCGTTTTTA GAGCAAGCTG 420 

AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 480 

ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 540 

ATGA 544 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
CCAGGAACTC AAATGTAAGT AGGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGCCTCTA 



60 
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CACTATTTGT AGCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA GTTAGAATAT 120 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGGTATATT CAAAATACAA 180 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 240 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 300 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGGATACC TATCCTACC 349 



(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 



CTCCGATAAC 


CACACCAGCA 


ATGGAAATAA 


TTCCATCGTT 


AGCATCAAGA 


ACACCCGCAC 


60 


GCAGGATATT 


TAAACGACCT 


GCAAAATTTG 


AATCAATTTC 


GTGATTTGTT 


TCTGACGCTA 


120 


AATTTCAAGT 


TCAAGTTAGC 


CATCAAGAAG 


TCTTCTCTGG 


GTGACTTGTA 


GTCCAAGCAT 


180 


TTTTTAGGAT 


AGTTGTTAAT 


CCACTTTTCG 


ATGAATGCGA 


CTTCTTTGGG 


AGTCATTTTC 


240 


TTGGTTCCCT 


TAGGTAACCA 


TCTACGAATG 


AGCCTGTTGT 


GATTCTCATT 


AGTTCCCGGG 


300 


ATCCTCTAGA 


GTCGACCTGC 


AGGCATGCAA 


GCTTGGCACT 


GGCCGTCGTT 


TTACAACGTC 


360 


GATGACTGGG 


GAAAACCCTG 


GCGTTACCCA 


ACTTAATCGC 


CTTGCAGCAC 


ATCCCCCTTT 


420 


CGCCAGCTGG 


CGTAATAGCG 


AAGAGGCCCG 


CACCGATCGC 


CCTTCCCAAC 


AGTTGCGCAG 


480 


CCTGAATGGC 


GAATGGGGCC 


TGATGCGGTA 


TTTTCTCCTT 


ACGCATCTGT 


GCGGTATTTC 


540 


ACACCGCATA 


TGGTGCACTC 


TCAGTACAAT 


CTGCTCTGAT 


GCCGCATAGT 


TAAGCCAGCC 


600 


CCGACACCCG 


CCAACACCCG 


CTGACGCGCC 


CTGACGGGCT 


TGTCTGCTCC 


CGGCATCCGC 


660 


TTACAGACAA 


GCTGTGACCG 


TCTCCGGGAG 


CTGCATGTGT 


CAGAAGTTTT 


CACCGTCATC 


720 


ACCGAAACGC 


GCGAAACGAA 


AGGGCCTCGT 


GATACGCCTA 


TTTTTATAGG 


TTAATGTCAT 


780 


GATAAGGATG 


GTTTCTTAGA 


CGTCAAGTGG 


CACTTATCGG 


GGAAATGTGC 


GCCGAGACCC 


840 


TATTTGTTTA 


TTTGTCTAAA 


TACATTCAAA 


TATGTATCCG 


CTCGTGAGAA 


AATAAACCTG 


900 


ATAAATGCGT 


CAATAATATT 


GAAAAATGAA 


GAGTATGAGT 


ATTCTACATT 


TCCGTGTCGC 


960 


CCTTATACCC 


TTTTTTGCGG 


CATGTTGCCT 


TCCTGTTTTT 


GCTCACCCAG 


AAAACGCTGG 


1020 


TGAAAGTTTA 


AGATGCTGAA 


AAATCATTTG 


GGTGCACAAC 


TGGGGTTACA 


TCCAACTGGA 


1080 


ATCTCCAnCA 


GCAGTTAAGA 


TCCTCTGACA 


GTTGTACACG 


CCGCAAGAAC 


TATTCCCGAT 


1140 
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GAATGAGCAA CTTTTAAAAG TCCTGCGAAT GTTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 
GCCCGG 12 Q6 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

CTGCTCAACT CAGACAGTCA AATTTCTGAC TTTACCAAAA GAACCATCAA AAAAGTTGCT 60 

GAAAAAGGCC ATCAGGTTAT TATTACGACA GGTCGCCCTT ACCGTATGTC AAAAGATTTT 120 

TACCGTGAAC TGGGCTTAGA CACTCCTATG ATTAACTTCA ACGGATCCCT TACTCATTTA 180 

CCAGACCAAG TTTGGGATTT TGAAAAGTGT TTGACTGTAG ACAAAAAATA TCTGCTAGAT 240 

ATGGTTCAAC GTTCAGAGGA CATTCAAGCC GATTTTATCG CTGGAGAATA TCGTAAAAAA 300 

TTCTACATTA CAAATCCCAA TGAAGAAATT GCCAATCCCA AACTATTTGG TGTAGAAGCT 360 

TTCCAGCCTG AAGATCAATT CCAGCCTGAA TTGGTGACCA AGGACCCTAA CTGTATCCTC 420 

TTGCAGACTA GAGCCAGTGA CAAATATTCC TTGGCAAAAG AAATGAACGC CTTCTACCAG 480 

CATCAACTTT CTATCAATAC CTGGGGAGGT CCGCTCAATA TCCTTGAATG TACCCCAAAA 540 

GGTGTCAACA AGGCCTTTGC TTTGGACTAC TTGCTCAAGA TAATGAATCG TGACAAAAAA 600 

GATTTGATTG CCTTTGGAGA TGAACACAAT GATACCGAAA TGCTCGCTTT TGCTGGGAAG 660 

GGTTATGCCA TGAAAAATGC CAATCCAGAG CTACTCCCTT ATGCAGATGA GCAAATTTCC 720 

CTTACCAACG ACCAAGATGG GGTTGCCAAA ACCCTACAAG ACTTATTCTT ATAACCTATA 780 

CTGATACTCA ATGAGGGGCA AAGAGCGAAC TTA 813 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCTAGATAAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 
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ACAAATTATT TAAACTAAAA GAGAACGGTA CAGACGTTCG TACAGAGGTT CTCGCTGGTT 120 

TAACAACTTT CTTTGCAATG AGCTATATTC TCTTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAG CGACGATTAT TGGTGCAGTA GCGGGTACCT 240 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAAGC GCCAGGTATG GGACTCAATG 300 

CCTTCTTTAC CTTTACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CAGCTATTTC AGCTGGTATC GGTGTCTTCC 480 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACGATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGAGAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACTCTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

CCATTACTAT CTTCTTTGTC ATC 683 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 339: 



CTACTTTACA 


TGGAAGTAGT 


CACTGAATTC 


CAGTTAGAAA 


TTACTTTGTA 


ACTACGTTTT 


60 


GAGGAGGAGT 


AAAATGCTTT 


CCTACGTTCG 


ATATTACCCA 


CTAGCGATAG 


CTAAATTAAT 


120 


GTGTCTGTGC 


TCTCCTAAAA 


TCTGCTGATT 


TATTACTGAC 


TAATACAGGA 


GGTTTTTTTT 


180 


ATGgACAGAC 


AATCATATCT 


GCTATTGGTG 


TTTATATTTC 


CACCAGTATC 


GATTATTTAA 


240 


TTATTTTAAT 


TATTTTATTT 


GCACAGCTAT 


CACAGAATAA 


ACAGAAATGG 


CATATTTATG 


300 


CGGGGCAATA 


TCTAGGCACA 


GGCTTACTTG 


TAGGGGCGAG 


TTTAGTTGCT 


GCTTATGTCG 


360 


TTAATTTCGT 


GCCTGAAGAA 


TGGATGGTTG GATTGCTTGG 


TTTAATCCCT 


ATCTATTTAG 


420 


GGATTCGCTT 


TGCAATTGTT 


GGAGAAGATG 


CGGAAGAAGA 


AGAGGAAGAA 


ATTATTGAAA 


480 


GATTAGAACA 


AAGCAAGGCA 


AATCAACTGT 


TTTGGACAGT 


TACATTGCTG 


ACAATTGCGT 


540 


CTGGCGGAGA 


TAATTTAGGT 


ATCTATATAC 


CTTATTTTGC 


TTCGTTAGAT 


TGGTCACAGA 


600 


CCCTCGTGGC 


CTTGCTTGTG 


TTTGTAATCG 


GCATAATTAT 


CTTTTGCGAG 


ATTAGTCGGG 


660 


TGTTATCCTC 


TATTCCGTTA 


ATATTCGAGA 


CAATTGAAAA 


ATACGAGCGA 


ATCATTGTGC 


720 


CCTTAGTATT 


CATTCTACTT 


GGACTATACA 


TCATGTATGA 


AAATGGCACG 


ATAGAGACTT 


780 
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TTCTGATCGT GTAGATTTTT TTGTTTCACT AGGGATTTAG CCCGAGCTCA AATCAGCTCT 840 
CTGATTTTCA GA 852 
(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



CCGCACAAAA 


GCGCATAGTA 


TCAAGATTCT 


ATAAAGCCTT 


GATACTATGC 


CTTTTTAATG 


60 


GATAAATAGT 


TAGTCTTTTT 


TAAAGACCGG 


ATCTTTCAAA 


CTCTGCATAC 


TGGCATTGAT 


120 


CACCGCGCCT 


AGGATAACAA 


TTTTAGCAAT 


CAAGATAAAC 


CAAAACATCA 


TAACAACAAG 


180 


AAGAACGGAA 


CCTAAAATTC 


GGACATCCAC 


CAAATGATGG 


ACATAGTAAT 


TGAGATAACT 


240 


AGAGAACAGA 


GTTAGTAAAC 


CTAAAATCAC 


TAAGAGAACA 


AAGGCACTGC 


CTGGTAGGGT 


300 


ATAGCTAATT 


TTCCTGTTAG 


ATAGATTGGG 


AAGAAAATAA 


TAAAGCATGA 


CCAAGATAGC 


360 


AAAGAGGAGG 


GCGTAAATCA 


GAGGACCTGC 


CAACCCTTGT 


AAAGCCTGAT 


AGATAATGCC 


420 


ATCTTTTGTC 


CAATAATGAG 


CAAGTAAAGC 


CAAAATCATC 


TGACCAAATA 


AGATCAAAAA 


480 


CAAGGCAAAC 


GCAAAGAGGA 


GCTGCAACCA 


AAACTGACTA 


GGAGACTTAG 


CATCTGATGG 


540 


GAAATAAGTC 


CACGACTCTT 


TTCGACGCCA 


TAAGCCTTGT 


TAAAAGCTTT 


TTGCAAGAAA 


600 


TTCATAGATT 


TTGAAAAACT 


CCATAACGCC 


GATAAAACAG 


AAAAACTCAA 


TAAACCTGTT 


660 


GAAGGTTGCG 


TCAAGACTTC 


TCTGGCTATT 


TTTTCCACAC 


CTTCATAGAG 


GCTTGGGGGG 


720 


CAGACGTCTT 


TCATAAAGCC 


CAAAAATTCT 


CCCA 






754 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
GGGGATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG AGTGCACAAG AAGTCAGGAT 60 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 120 
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CAAGGTTTGC TTTATTATTT GCTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 240 

ACAGAGCAAG GAGACCAAGC TGGACATCTG GTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTGCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 360 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 420 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 480 

GTGAGTTTAG GCTTGTTGGT CTTTAACCTT GGGCATAGAC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGGCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAAGTTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACAGTATT 660 

CGCTTTGATT TTACTTCTAA AATTTTAGAG AATATGTTTT TTTAAGG 707 
(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

GGATTTTGAA AAACCATACC GATTTGACGA CGTATATTCC AAACATTTTC CTCAGTCAAA 60 

CGTTGGCCAT CAATTACAAT CTCTCCGGAT TCTGCTTCCA GTAAGCCATC AATTAATCGA 120 

ACCGTCGTTG ATTTACCACT ACCATTATGC CCTACAATCG AAAGCCATTC TCCACGTTTC 180 

ACGTGAAAgT AATATCCTTC ACATCGTAGT AGTTCTGATT TTCTTTATAG CGAAAAGAAA 240 

GATTTTTTAC ATCAATTATT GATTTCATTT CGAACCAAAT GTCCCTTTAA ATACATAGGC 300 

ACTACCCTTG AAATAGTCAT AGCCAGAGTA GATAGTGAAA AATAAGGCTA CATAAAGTAG 360 

AACTTGACCA AGCAAAGTCC AATGTAATAG CAAGAAAATA ATGGCAAACA TCTGACTAAA 420 

AGTTTTAATT TTTCCAGGCA TTGCTGCTGC TAAAATTGTT CCACCAGTTT CAACCAATAA 480 

AAGCCTTAAA CCTGTCACAG CTAACTCACG ACAGATAATC ACTGCAACAA TCCAAGCCGG 540 

AGCCATACCT AACTCAATCA ACATAATAAA AGCCGACATA ACTAGTAACT TATCCGCCAT 600 

AGGATCTGCA AATTTACCAA AATTACTGAC CACATTCCAT TTACGAGCTA AATATCCATC 660 

TAAATAGTCG GTAATACTGG CAACAGCAAA GATAATAGCT GCAACTATAT GACTCTCTAT 720 

CGAATTTCCT ATCGTTAAAA TAAAGATAAA AATAGGTATA AA 762 
(2) INFORMATION FOR SEQ ID NO: 343: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

-CTTTTGATAC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTCAGC GTTTTGCTTC 60 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 120 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 180 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 240 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 300 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTCTC GCAAACTACA 360 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAAGATT CAGTCCAAAT 420 

CGCGGCTAAA AGTGAACCCA ACTGGGCAGT CAATTGGTTC GTTGGCGACT TGGCACGACT 480 
TC 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



482 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 120 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 240 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 300 

CTACACTTAA AGATTATGGA TTTAGCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 360 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 420 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGGGT 52 Q 



1364 

(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 345: 
GCATCAAATC CGCCATCAAA GAAGTTCTCT GGATTTACCA AGACCAGTCA AATAGCTTAG 
AAGTGCTTAA TGACAAGTAC AATGTTCACT ACTGGAATGA CTGGGAAGTT GGAGACACGG 
GAACCATTGG TGAGCGCTAT GGTGCCGTTG TTAAGAAACA CGACATTATC AATAAGCTTC 
TCAAACAGTT GGAAACCAAT CCTTGGAACC GCCGCAATAT TATTTCGCTC TGGGATTACC 
AAGCTTTCGA AGAAACAGAT GGGCTGCTCC CGTGCGCCTT TCAGACCATG TTTGATGTTC 
GGCGTGTTGA TGGGGAAATC TATCTGGATG CGACCTTGAC CCAGCGCTCC AATGATATGC 
TGGTGGCCCA CCACATCAAC GCTATGCAGT ATGTGGCTTT GCAGATGATG ATTGCCAAAC 
ATTTTGGCTG GAAGGTTGGG AAGTTCTTCT ACTTCATCAA CAACCTCCAT ATCTATGATA 
ATCAATTTGA ACAAGCTCAG GAATTGCTCC GTCGGGAgCC GTCAAACTGC CAACCACGCT 
TGGTTTTAAA TGTTCCTGAT GGGACTAATT TCTTTGATAT CAAAGCAGAA GATTTTGAGT 
TGGTGGATTA TGACCCTGTT AAGCCACAGT TGAAGTTTGA CCTAGCTATT TAAAAGAATA 
GAAAAAAGAA GTTGAGAATA ATCCCAACTT CTTTTGTTTC TTAACGTGAT ACGCGGCGAC 
GAGCTGCTTT TTTACGGTTT TCTTCGATGA AAGCTGCTTT TTGCTCTTCT GGTTCGATTA 
CTTTCTTTTT AAATGCGTAT ACTGCACCTG CAACGGCAGC GACAGTTCCT GCGACACCTG 
TTACAAGACC TTTAGCGAAT CCTTTAGCCA TGAGTCTTCC TCCTTTATAT TCTCAATCAG 
CCAGCCTCCT CAAGAGGTCA CATTTTTCTG ACTGACCTTT TTGTGTTATA ATAATAGTAA 
CGAAAAAATG GGAATTTTTC AAGGAAAAAA GATGAGAACA AAA 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGGC GCAACTGGTC 
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TCAACGCCTT TGAACGCCTC TTGGTCGGCT CTTCATCTGA ATACATACTC CGCCATGCTA 120 

AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGGAG 180 

CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCTCTTTA TGGCGTTCGT AAGCCTTGAG 240 

CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGGAGCA TATTTTTCTT CCCAATTATC 300 

TGGTTTTAAG ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 360 

CGCCATATTG GCCTGATGGA CAAT AT C AAA AATACGTTCT GGGTCCACCC CCATCAAGAC 420 

AAAACTGCCG TAGGTGAAGT AAAGCGTGTC AATCAAGGCA TCCACTTGCC CTATCAAATC 480 

TTGCTGAGCA GGTGTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATGAAG 540 

TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 

CAATTCTTCT ATTTTAAAAC CAGCCCTATG GGTTGCACCC TCTAAATCCC AAGCTCGAGG 660 

TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTGACCTTAT TGAAATGATA 720 

GTCACGGCTG ACAAAGACTT TTTCTGAAGA 750 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAAGCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 240 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 300 

TACGAAGAAG GTCGATAGAT GTCAkTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 360 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 420 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 480 

TTTGAGCCAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATCAGC 540 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
{2) INFORMATION FOR SEQ ID NO: 348: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 



CAGAGTCAAC 


AGCCTGAGTT 


GAAGGCAACT 


TTAGACACAG 


CAGTTACGAC 


AGCTGAATGA 


60 


GCTCCTCCAT 


CAGTTTTTTC 


TTTAATGAGT 


CCAGCTACAT 


CTTCAACTTC 


GAGGCCGTTA 


120 


ATCACAATGT 


CAGCGCCTAC 


TTCTTTTGCA 


AGGGCAAGTT 


TGTCATTGTT 


GATATCGACT 


180 


GCGATAACAT 


GAGCATTGAA 


TACTTTTTTA 


GCGTATTGAA 


CAGCGAGGTT 


ACCAAGTCCA 


240 


CCAGCACCGT 


AAAGAACAAC 


CCATTGGCCT 


GGTTCAACTT 


TTGCTTCTTT 


GATAGCTTTA 


300 


TAGGTTGTTA 


CTCCAGCACA 


TGTGATAGAA 


GAAGCTTGGG 


CTGGATCAAG 


TCCGTCAGGA 


360 


ACTTTGACAG 


CATAGTCAGC 


AGTTACGATA 


CATTGTTCAG 


CCATACCACC 


GTCTACTGAG 


420 


TAGCCAGCAT 


TTTTCACTGT 


ACGGCAAAGG 


GTTTCGCGAC 


CAGTTGTACA 


GTATTCGCAA 


480 


GTGCCACATC 


CTTCAAAGAA 


CCAAGCAACG 


CTGACGCGGT 


CACCGACTTT 


AAGGCTTTTC 


540 


ACATCTGGAG 


CAATCTCTTT 


AACGATACCG 


ATACCTTCGT 


GCCCAAGAAC 


ACGTCCTGGG 


600 


ACTTGACCAA 


AGTCACCATG 


AGCAACGTGG 


AGGTCGGTGT 


GGCAAACGCC 


CACAGTATTC 


660 


ACTTCTACAA 


GTG 










673 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGGTGGTC AATGGAACTA TGGAGTAGGT 60 

TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGATATCA TACAGCAACT 120 

GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 180 

TCCCTCACCA AGATTCCG 198 
(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 



GCTTCTTCTA 


TAGACAAAAA 


TATCATGGGT 


AAAATAATCA 


AGGCTATAGC 


TAGAAGGAGG 


60 


GACCAATCCA 


CTACTAATCC 


TAAGAACAAA 


ACACTCAAGA 


GAGCAGAAGA 


GAGAGGTTCA 


120 


CTGGCACTGA 


TAACGGCAAC 


CACCAAAGGA 


GAAACCAAGG 


ACACAGCCTT 


CATGGAAATG 


180 


AAAAAAGCAA 


AAGCCGTTCC 


AAAGAAAGCG 


ATAATGAGGC 


AAATCAAGAT 


ACTCCAAATA 


240 


TCAAGAGTAA 


AGGAAAGCTG 


ATAAACCGGC 


GAGAGGACAT 


TGCTAAACAA 


ACCTGCCAAA 


300 


ATCATCCCCC 


ACCCAACCGT 


AGGAACAAAA 


CCATAACGCT 


TAGCAAAAGG 


TTGGGGCAAG 


360 


ATAACATTAA 


ACATAACACC 


CATGGCACTC 


AGCAAACCTG 


TTATAAGAGC 


TAGCGGCGTC 


420 


ATGGATAACT 


GAGAGAGGTC 


TCCCTTTGTC 


GCCATCAAGC 


AAACACCCAG 


CATGGCAACC 


480 


AAAACATAGA 


AAACAGCGCT 


TTTTGACGCT 


CGTTTTTGAT 


AAACCAAGCG 


ATTGTAAAAG 


540 


AGGATAAAGA 


CAGGGCTAAT 


AAACTGTAAA 


ATAGTTGCTG 


TCGTAGCATT 


TGAGTATTCT 


600 


ACACAGAGAT 


AGAAAAAATA 


CTGAACTGAA 


AAAATCCCCA 


AAATAGCATA 


GGCTAAAAAG 


660 


GGCAGGTAAT 


TTTTCTTGTC 


TCGCCAAATA 


TCTAGCACTT 


GCGATTTTAA 


TTGTATTGCA 


720 


GACCAAATGA 


GTACAAGACT 


CCCTGCCAGT 


GTCAAACGCA 


TAGAGGTAAT 


CCAGCCCGAA 


780 


GACACCTGAT 


AATGAGTAAA 


GAAGTACTCT 


CCTAAAATTC 


CACAGATTCC 


CCATATTAAG 


840 


CCGGATAGGA 


GCGAATAAAT 


TTTTCCGTTA 


ACAATCTTTT 


TCTGATACTG 


A 


891 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 120 

TGGGCCAGAT ACTGTTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT 180 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG 300 
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ATGCGTTTGA CTGTCAAATG GAGTG 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 
CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCGAATTTA AAATACAAAT ATGGCAATCG 
CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 
TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 
AGTCAGTAGA TCCGTTTACT GGCGAAATAA ATAAGAGGAA GTAACGTnAA GTGCTTTAGC 
ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 
AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 
CCCTATCCCT GCTATTGGGG CTGCTCTCAT TGCTGCTTTG GCACAAATCA GTCTTCCAAT 
TGGACCTGTT CCCTTCACTC TGCAAAACTT TGCAATCGGC TTGATTCTAC TGTCTTTAGA 
CCGAGAGAGG CTGTACTTTC TGCTGGACTC TATCTTCTTC TAGGTGCTAT CGGTCTTCCT 
GTCTTTGCAG GAGGTGGAGC TGGTTTTCAG GCTTTAGTTG GCCCTACTGC AGGCTATCTT 
TGGTTTTATC TCGTTTACTC TGGACTTACT TCCTCTCTAA CCAACAGCAA GAGTGGTGTT 
GTTAAGATTT TTCTTGCAAA CCTCTTGGGT GATGCCCTTG TCTTTGTCGG CGGGATTCTC 
AGCTTGCATT TCCTAGCTGG AATGGCATTT GAAAAAGCTC TTGCTGTGGG GGTTCTTCCC 
TTTATCATTC CAGACCTTGG CAAACTTCTA GCTATTAGTT TTATTAGCCG TCCCCTACTT 
CAACGCCTTA AAAATCAGGC TTACTTTACT AACTAAAAAA GGATATCGAG TTATCATGAC 
TCAATATCCT TTTCTTTTAT TTTGAAAACT TATACTCAAT GAAAATCAAA GAGCAAACTA 
GGAAGCTAGC CGCAGGCTnG CAAAACACTG TTTTGAGGTT GTGGATGAAA CTGACGAGTA 
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AnATCTCATA CATACGGCAA GGCAAAGCTG AC 692 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1005 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GTGATGGACT ACTGGTTCAA AACGCATCCA GAAGATTTTT TCGATAATGT CGGACCTCTT 60 

GTAGCCAGTA ACTTTTTTCA TACTTACACC GAAGATTTCC ACTTGATGAA GGAAATTGGA 120 

GTTAATTCTT TCCGCACTTC CATCCAATGG AGTCGACTCA TCAAGAATTT AGAGACAGGT 180 

GAGCCTGATC CAAAAGGTAT TGCTTTCTAC AATGCCATCA TTGAAGAAGC TAAAAAGAAC 240 

CAGATGGATC TTGTGATGAA TTTACATCAT TTTGATTTAC CAGTGGAACT TCTTCAAAAA 300 

TACGGTGGTT GGGAAAGCAA ACATGTAGTG GAGTTATTCG TGAAGTTTGC CAAGACTGCT 360 

TTCACATGCT TTGGAGATAA GGTTCATTAC TGGACAACTT TCAATGAGCC AATGGTCATT 420 

CCAGAAGCAG GGTACTTATA TGCTTTCCAT TATCCAAATC TAAAAGGAAA GGGAAAAGAG 480 

GCCGTACAAG TCATCTATAA TCTAAACCTT GCTAGTGCAA AAGTGATTCA ACTATATCGC 540 

TCATTAGAAC TTGATGGAAA GATTGGGATT ATTTTAAACT TGACACCTGC TTATCCAAGA 600 

AGTAATTCTC CAGAAGACTT AGAAGCAAGT CGATTTACAG ATGACTTCTT TAACAAAGTC 660 

TTCTTGAATC CAGCTGTTAA AGGAACTTTC CCAGAAAGAT TGGTAAAACA GCTAGAGAGA 720 

GATGGCGTGT TATGGAGTCA TACCGAAAAA GAGCTTCAAC TGATGAAATC AAATACGGTT 780 

GATTTTCTTG GAGTAAACTA CTACCATCCA AAACGTGTTC AAGCACAAGC AAATCCTGAG 840 

GAATATCAGA CGCCCTGGAT GCCAGACCAA TACTTCAAAG AGTATGAATG GCTGGAGCGT 900 

CGCATGAATC CATATCGTGG TTGGGAAATT TTTCCGAAAG CCATTTATGA TATTGCTATG 960 

ATTGTGAAGG AAGAATATGG TAATATCCCA TGGTTTATCA GTGAA 1005 



(2) INFORMATION FOR SEQ ID NO: 355: 

U> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

CCGACAAGCA ATATTAAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 60 

TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAAGAATG AAAATTAATA AAAAATATCT 120 

AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGTTTGTTCC TATGAGCTTG GACGTTACCA 180 

AGCTGGTCAG GATAAGAAAG AGTCTAATCG AGTTGCTTAT ATAGATGGTG ATCAGGCTGG 240 

TCAAAAGGCA GAAAACTTGA CACCAGATGA AGTCAGTAAG AGGGAGGGGA TCAACGCCGA 300 

ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 360 

TTACTATAAT GGCAAGGTTC CTTATGATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 420 

TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 480 

TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA GCTCATGCGG ATAATATTCG 540 

GACAAAAGAA GAGATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 600 

AGATAATGCT GTTGCTGCAG CCAGAGCCCA AGGACGTTAT ACAACGGATG ATGGGTATAT 660 

CTTCAATGCA TCTGATATCA TTGAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 720 

CC ATT AC CAT TACATTCCTA AGAATGAGTT ATCAGCTAGC GAGTTAGCTG CTGCAGAAGC 780 

CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCTAGTTATA ATGCAAATCC 840 

AGCTCAACCA AGATTGTCAG AGAACCACAA TCTGACTGTC ACTCCAACTT ATCATCAAAA 900 

TCAAGGGGGA AACATTTCAA GCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 960 

CATGTGGGAT CTG 97 3 



(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 60 

TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 120 

CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 180 

GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 240 

TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 300 

GCTGAAAGAG TTTCCTTGAT GGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 360 
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GTCGCAAGGG CATTTTCCAC ATTGTGGCTA CCTGGAACAC CGATTTCATT CGCTGCCATG 420 

ACTACTTCAC CACGGAAGTA GAGTTGACCA TCTTCCAGAT AAGCTCCATC AACCTTTTCA 480 

AGTGTTGAAA ATGGTACAAC AGTGGCTTCT GTCTTGGAAG TCAAGTCTTT TGCCAAGTCT 540 

TGATTAAAGT TCAAGACAAG GAAATCAGCT GCTGTCATCT TGTTCTGGAT ATTCCACTTG 600 

GCTGCTACAT ATTCCGAAAA TGACCCATGG TAGTCGATAT GAGTTGGCAT GAGGTTGGTA 660 

ATAACCGCAA TCTCTGGATG GAATTCTTGA ACACCCATGA GTTGGAAAGA AGAAAGTTCC 720 

ATAACAAGCG TGTCCTTATC TGATGCTATT TGAGCAACCT GACTAGCTGG ATAGCCGATA 780 

TTCCCTGATA AAAGACCATG TTGGCCAGCA GCAGTCAAAA CTTCCCGGGn TCCTCTAGAG 840 

TCG 843 



(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 



TTTTTTTTAT 


ATTTTTTTTA 


TTTATTATTT 


TTTGGCAAAA 


AAGACCAATT 


TGCTTTGGAG 


60 


CATTGCTTCT 


GCATTAAATT 


GTCTATTTTT 


GCTCGTGCTG 


TTACGCTCTT 


TGTATCATGT 


120 


ATTAACTAGC 


AAGTGCAACT 


TGCAAACTAC 


TAGTAAGAGG 


AGAAAAACAA 


AATGGTTATG 


180 


ACTGACCCAA 


TCGCAGACTT 


CCTAACTCGT 


ATTCGTAATG 


CTAACCAAGC 


TAAACACGAA 


240 


GTACTTGAAG 


TACCTGCATC 


AAACATCAAA 


AAAGGGATTG 


CTGAAATCCT 


TAAACGCGAA 


300 


GGTTTTGTAA 


AAAACGTTGA 


AATCATTGAA 


GATGACAAAC 


AAGGCGTCAT 


CCGTGTATTT 


360 


CTTAAATACG 


GACCAAATGG 


TGAGAAAGTT 


ATCACTAACT 


TGAAACGTGT 


TTCTAAACCA 


420 


GGACTTCGTG 


TCTACAAAAA 


ACGTGAAGAC 


CTTCCAAAAG 


TTCTTAACGG 


ACTTGGAATT 


480 


GCCATCCTTT 


CAACTTCTGA 


AGGTTTGCTT 


ACTGATAAAG 


AAGCACGCCA 


AAAGAATGTT 


540 


GGTGGTGAGG 


TTATCGCTTA 


CGTTTGGTAA 


AATCAAGATA 


CAAAGCTCGT 


AAAGAACAAA 


600 


GCAAAATTAG 


GAAGTTGGAG 


AAGTTTGTTT 


ACAAACAGGC 


CAACTTATCT 


ATTTTGCACA 


660 


GTTCTTAGAG 


CGTGTTCAGT 


TCAGCTCTTG 


AGCTAAGTAA 


GTATCTGAAC 


CCCGTGAAAA 


720 


CTGGCCGTGC 


TGGCATGTTC 


GGGTAACAGG 


AGAnAATAAA 


CATGTCACGT 


ATTGGTAATA 


780 


AGTTCAGCTA 


AGGCCTTCGT 


AAAAGTT 








807 
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(2) INFORMATION FOR SEQ ID NO: 358: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGGATGATA CAGATCTGGA ATTGGCTCTC 60 

TTAACCATCT tTGAACAAAA TCCTCAGGCT CAGGTCACTA TTTTCGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GGCACCCTAT 180 

ATG CATC AAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 300 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 360 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 420 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTGCCAAT CTAGCTGGTC 480 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 

ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 600 

GCCAGTTAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA 653 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 120 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CAGGCGAGTT GACCAGTCAG GAAACAGATG 180 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 300 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 360 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 420 
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TCAACCAAGT AGGAAACTTT GTTAAAAGTA ACTTGCTCAA CGAGTGGAAG GGTAAAATTG 480 

CTACGGATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 540 

AGGTCGATAC TGTCAGCACA GTGATTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 600 

CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 641 
(2) INFORMATION FOR SEQ ID NO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(Di TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



CCTCAAGGCC 


AATTTGAAGG 


CTCTAAAACA 


ATGGAAAAGT 


GCTACACAGA 


TGTGACAGAA 


60 


TTTGCCATTC 


CAGCAAGTAC 


TCAAAAGCTT 


TACTTATCAC 


CAGTTTTAGA 


TGGCTTTAAT 


120 


AGCGAAATTA 


TTGCTTTTAA 


TCTTTCGACT 


TCACCCAACT 


TAGAACAAGT 


ACAAACAATG 


180 


TTAGAACAGG 


CATTCAAAGA 


GAAGCACTAC 


GAGAATACGA 


TTCTCCATAG 


TGACCAAGGC 


240 


TGGCAATATC 


AACACGATTC 


TTATCATCGG 


TTCCTAGAGA 


GTAAGGGAAT 


TCAAGCATCT 


300 


ATGTCACGCA 


AGGGCAACAG 


CCAAGACAAC 


GGTATGATGG 


AATCTTTCTT 


TGGCATTTTA 


360 


AAATCCGAAA 


TGTTTTATGG 


CTATGAGAAA 


ACATTTAAAT 


CACTTAACCA 


ATTGGAACAA 


420 


GCCATTATAG 


ACTATATTGA 


TTACTACAAC 


AACAAACGAA 


TTAAGGTAAA 


ACTAAAAGGA 


480 


CTTAGTCCTG 


TGCAGTACAG 


AACTAAATCC 


TTTGGATAAA 


TTAATTGTCT 


AACTTTTTGG 


540 


GGTCAGTACA 


AAACTCTTGC 


TACTATGCGT 


TTTATTATTG 


AAAGACTTAT 


TGGACTTTCT 


600 


CTCAAATCGA 


GTTTTTACTC 


AATTTTCTTA 


CTTGATTGGG 


ATTGAAATTC 


CAATTAATTT 


660 


CTCTGAGTAG 


AGTGTCTTGA 


TATTGGCTTC 


ATCAACAGAG 


GCCTTATCAA 


TTTTACGTTT 


720 


CAAGAAAAAT 


TCTTGAATGG 


TTTCGATTTC 


AGGCTCACGA 


ATAGCACGGT 


GTTTGTTTGA 


780 


GATGAGGATT 


TCATAGTGAA 


GCGGAGCTTG 


GGTAAAAATA 


ACATCTGTAT 


TCCCTGCAGA 


840 


ATAAACCTCA ACAAGGGTTG 


CATCGGTACT 


TTCTAGCTGA 


CTTTTTACAA 


GTTGCGAGTG 


900 


TGAGTTTGTC 


GTATTGATAA 


GCTTCATAAT 


ATTTCCTCCG 


ATTTTCTAAT 


TCTATTATAG 


960 


CACTTTTTGA ATAAAGTCGC 


TTGATTTATA 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


1020 


GCTAGCCGCA 


GGCTATACTT 


GAGTACGGTA 


AGGCGACGCT 


GACGTGGTTT 


GAATTTTATT 


1080 


TTCGAAGAGT 


ATTAGCCAAT 


CTTATGCTGT 


TTTTTCCAAG 


ATTCAATGGC 


CCATTTATGG 


1140 
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CTACCACGTT TAAGGTTTTT GATAGCCTCG TCAATAGGGA ACCAGGCAAT ATGATTAAAG 1200 
TTTTCTAGTG GCTTTTGTAC TTCTTTGAAA GGAGTTGCTT CATAGAGGTA GGCAGGATTG • 1260 

TAGTAGTAGG TATCACGATG ACGAGAATAG AAATATTCGT CAGCTTGTCC GTAATAGGTA 1320 

CCAATTTCTG CTGTGAAACC AAGCTCTTCA ATCAACTCAT GCTTTAGGGC TTCCTGATGA 1380 

TTTTCACCTG CTTCAATTTC TCCACATGGT AGGAACCAAG CACCATTTGG TTCTTGAACA 1440 

AGAACAATTT GTTTTTGTTC AGGATTAGGG ATAACTGCAT ATACGCCATA GCGAGCAATA 1500 

TAGTCTGTAT TCACTTTTTT TCTCCGAAAG TTGGGTTTGC CATTGCATTT TCCTCATTAT 1560 

CTAGTATCGT TATTATTATA GTGAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 1620 

TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTTCTC TCTTATTGTG AGCGAACAGG 1680 

TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACGT AATACCATTT ATGGCTGGTT 1740 

AAAGCTAAAA GAGAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAATAA AACCAAGAAA 1800 

GGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGACGCTT ATTTGACTGA 1860 

AATAGCTTCT GAATTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 1920 

tACACTCGAA AAAAAAAAGA ACTACACCTA CTATGAAC 1958 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 60 

CTTAGAGAAG CTTTCAAATA AATTTGGGAT AmACAATTCT AATCTTAGGT ATATGATTAA 120 

ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 180 

TGATTTAAAA CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 240 

TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 300 

GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 3 60 

CCATCCTAAA AAAGTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 420 

AGAAATTGTT TAAGAATTAA TGACTGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 480 

ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAAC C AG ATAAGGACCA 540 

AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 600 
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TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 660 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 720 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 780 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 



GCTTTACTTA T 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



851 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GGGTAGAATC GATATCTCCA ATGAGTTGGT tTAGCTGGTG AAACTGTAAA AAGATTTCGw 60 

CCAATTCAAG GTTGAGGCAT CGCAAACTAT GGACTGTTTC CTCGTCAGTT CTGGAAAGAA 120 

AACGGGATAA GGTTGGCTGT GAAGCAAGCT GCCCTCCTTC CAACAATTTT GGAAAGTAGG 180 

CATCAGCTGA CAATTCTTTA CAAGCATAGT CCGTTCCATA ACCTGTTAAC AGTTGAAAGA 240 

GGAACTGGAC AAGGATATCT GAATCCGAAT AACGACAGTA GCGGCGTTGG TCATTCGTTA 300 

CTAAATACTT AGAAATCCGC TCTTTTAGTT TCAACTGGGA AAAAAGTTCC TGAAAAAAGA 360 

TAAGACCACC ATACTGGGTT AAATGACCTC CATCGAAAGA TAGTTGGTAA AAAGACTTGT 420 

TTTGGAAGTG ATGATTTGGT AAACTGTTCA TGTGAGTTTC CTTTCTTTTT GTGTTTTTTT 480 

CTACACTTAT ACCATAAAGG GGAAACTCTT TTTTGTCTAG TAAAAAACAC CCATTGGGTG 540 

AAAAAAGAAA CCATCCAGGA TCTAAGCTAA GGCAAGGATT CTGGATGGTT TTTAGATTTG 600 

GGGTGAATAA TTGGGGTTTT AGCTGCTTGC GGCCAATCAG GTTCAGATAC AAAAACTTAC 660 

TCATCAACCT TTAGTGGAAA TCCAACTACA TTTAACTATC TATTAGACTA TTACGCTGAT 720 

AATATAGTCA ATTGAAACAA GAACAAGACA AAAGAGCCTC ATAAAAGGTA TTGCAACTTG 780 

GTAATACCTT TTTGAGGTGC TTTTTGATAT GAGCCCATGT TTTCTCAATA GGATTGTACT 840 

CAGGTGAGTA GGGAGGAAGA GGTAAAAGTT TATACCCAAA CTCTTCACAC AAGAGTTCTA 900 

ACTTACCCAT TCTATGGAAT CTTGCATTAT CCATAATAAT AACCGATGGT GTGTTTAATG 960 

TTGGTAAGAG AAATTTCTGA AACCAAGCTT CAAAAAAGTC GCTCGTCATC GTCTCTTCGT 1020 

AAGTTATTGG AGCGATTAAC TCACCATTTG TTAGACCTGC AACCAAAGAA ATCCTCTGAT 1080 
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ATCTTCTTCC AGATACTTTG CCTCTTCTTA ACTGACCTTT TAATGAGCGA CCATATTCTC 
GATAAAAATA AGTATCGAAT CCTGTTTC 
(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1140 
1168 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 
GTCAGCTTCA GCAAGCCCAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC 
GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC 
GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 
TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 
GAGTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 
ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 
CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 
ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 
ATCGGCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 
AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTGCTTCA GCCTCAGCAT CGACAAGTGC 
CTCGGCTTCA GCAAGCACAT CAGCATCTGA ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 
AAGTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 
ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CAGAGTCAGC 
AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 
GTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 
AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 
CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACGAGTGCGT CTGAATCGGC 
ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 
ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 
GTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 
GTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 
ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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TCrGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCkTC AGCCTCAGCG 1380 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 1440 

TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC AGCAAGTACT 1500 

AGTGCATCAG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCGACAAG CGCCTCAGCT 1560 

TCAGCAAGTA CCAGTGCGTC AGCCTCAGCG TCGACAAGTG CGTCGGCTTC AGCAAGTACC 1620 

-TCAGCGTCTG AATCAGCATC AACAAGTGCG TCGGCTTCAG CATCAACGAG TGCATCAGCT 1680 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 1740 

AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCCG CATCAACAAG TGCCTCGGCT 1800 

TCAGCAAGCA CCAGTGCTTC GGCTTCAGCG TCAACGAGTG CGTCTGAGTC AGCATCAACG 1860 

AGTGCGTCAC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCACTTC 1920 

CGCATCAACA AGCGCCTCGG CCTCAGCAAG TACAAGTGCT TCAGCCTCAG CATCAACCAG 1980 

TGCATCAGCT TCAGCCTCAA CAAGTGCTTC AGCCTCAGCG TCAACGAGTG CCTCGGCTTC 2040 

AGCAAGTACC AGTGCGTCAG CTTCAGCAAG CACAAGTGCG TCAGCTTCAG CATCAACCAG 2100 

TGCTTCGGCT TCGGCATCAA CAAGTGCCTC AGCATCAGCA TCAACGAGTG CGTCAsCTCA 2160 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACCAGTGCAT CAGCCTCAGC AAGTATCTCA 2220 

GCGTCTGAAT CGGCATCAAC GAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA 2280 

GCGTCAACCA GTGCATCAGT CTCAGCAAGC ACCAGTGCGT CGGCTTCAGC ATCAACCAGT 2340 

GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT CAACGAGTGC GTCAGcCTCA 2400 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACGAGTGCAT CGGCTTCAGC AAGTACCAGC 2460 

GCCTCAGCTT CAGCAAGCAC CAGTGCGTCA GCCTCAGCAA GTACCAGCGC CTCAGCCTCA 2520 

GCAAGCACCA GTGCCTCAGC TTCAGCAAGT ACCAGTGCGT CAGCCTCAGC GTCGACAAGT 2580 

GCGTCGGCTT CAGCAAGTAC CTCAGCGTCT GAATCAGCAT CAACGAGTGC ATCAGCTTCA 2640 

GCATCAACAA GTGCTTCAGC TTCAGCAAGT ACCAGTGCGT CGGCTTCAGC ATCAACGAGT 2700 

GCTTCAGTCT CAGCGTCAAC CAGTGCCTCT GAATCAGCAT CAACAAGTGC CTCGGCTTCA 27 60 

GCAAGCACCA GTGCGTCGGC TTCAGCAAGT ACTAGTGCAT CGGCTTCAGC ATCGACAAGT 2820 

GCGTCTGAAT CGGCATCAAC GAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCAGCCTCA 2880 

GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC GTCAACCAGT 2940 

GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCGGCCTCA 3000 

GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCCGC TTCAGCAAGC 3060 
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ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCG 3120 

GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT CAACCAGTGC GTCGGCTTCA 3180 

GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT 3240 

GCGTCTGAGT CAGCATCAAC GAGTACGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCG 3300 

GCATCAACCA GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT 3360 

GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG GCCTCAACCA GTGCATCTGA ATCGGCATCA 3420 

ACCAGTGCGT CAGCCTCAGC AAGTACTAGT GCATCAGCTT CAGCATCAAC GAGTGCATCG 3480 

GCTTCAGCAT CAACCAGTGC CTCGGCTTCA GCGTCAACCA GTGCGTCAGC TTCAGCAAGT 3540 

ACCAGTGCTT CAGTCTCAGC ATCAACAAGT GCTTCAGCCT CAGCATCGAC AAGTGCCTCG 3600 

GCTTCAGCAA GCACATCAGC ATCTGAATCA GCGTCGACAA GCGCCTCAGC TTCAGCAAGT 3660 

ACCAGTGCGT CAGCCTCAGC GTCGACAAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA 3720 

GCTTCAGCAT CAACGAGTGC ATCGGCTTCG GCGTCAACCA GTGCATCAGA GTCAGCAAGT 3780 

ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT CAGCAAGCAC CAGTGCGTCG 3840 

GCTTCAGCAA GTACTAGCGC CTCAGCGTCA GCCTCAACCA GTGCGTCAGC CTCAGCAAGT 3900 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA 3960 

GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACGA GTGCGTCTGA ATCGGCATCA 4020 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT CAGCGTCAAC AAGTGCATCG 4080 

GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCGTCA 4140 

ACAAGTGCAT CGGGTTCAGC GTCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTGCGTCA 4200 

CCTCAkCAAG CACATCAGCT TCTGAATCTG CATCAACCAG TGCGTCACTT CCGCATCAAC 4260 

AAGCGCCTCG GCCTCAGCAA GTACAAGTGC TTCAGCCTCA GCATCAACCA GTGCATCAGC 4320 

TTCAGCCTCA ACAAGTGCTT CAGCCTCAGC GTCAGACCAG TGCCTCGGCT TCAGCAAGTA 4380 

CCAGTGCGTC ACTTCAGCAA GCACAAGTGC GTCAGCTTCA GCATCAACCA GTGCTTCGGC 4440 

TTCGGCATCA ACAAGTGCCT CAGCATCAGC ATCAACGAGT GCG 4483 
(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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GTACCTCAGC GTCCTTCCGC CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCGGCC TCAGCAAGCG 120 

CAAGTACCTC AGCGTCACTT CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

GTCAsCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 240 

TCAACGAGTA CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

.TCAGCCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 3 60 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 420 

TCAGCAAGTA CTAGTGCATC AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 480 

AGTGCCTCGG CTTCAGCGTC AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

TCAGCATCAA CAAGTGCTTC AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

TCAGCATCTG AATCAGCGTC GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

TCAACCAGTG CGTCAGCCTC AGCAAGTACT AGTGCATCAG CTTCAGCATC AACGAGTGCA 720 

TCGGCTTCGG CGTCAACCAG TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

TCAACAAGTG CCTCGGCTTC AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TCGGCTTCAG CAAGTACCAG TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

TCTGAATCGG CCTCAACCAG CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCACAAGCG CCTCGGGTTC AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TCAGCCTCAG CATCAACAAG TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

TCAACGAGTG CGTCTGAGTC AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

TCAGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 12 60 

AGTACTAGCG CCTCAGCATC AGCGTCAACA AGTGCTTCGG CTTCAGCGTC AACGAGTGCG 1320 

TCTGAGTCAG CATCAACGAG TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 1380 

TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TCAGCCTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

CAGCATCAAC GAGTGCATCG GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

GTGCGTCACt TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

AGCGTCAACC AGTGCTTCGG CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACCAG 1740 
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CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC TGAATCGGCC TCAACCAGCG CCTCGGCCTC 1800 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG 1860 

TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 1920 

AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 1980 

AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC 2040 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGTGCG TCAGCCTCAG CATCGACAAG 2100 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC 2160 

AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 2220 

GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 2280 

TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 2340 

GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT ACAGCAAGTA CTAGTGCATC 2460 

AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 2520 

TACCAGTGCG TCAGTTCACG CATCAACAAG 2550 



(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1436 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA 60 

CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAC 120 

CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 180 

CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCAGCC 240 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 300 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 360 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 420 

AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 480 

TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 540 

TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 600 
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TCAGCAAGTA CCAGTGCGTC AgCCTCAGCA AGTACCAGTG CTTCAGCCTC AGCGTCGACA 660 

AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA 720 

AGTACTAGCG CCTCAGCCTC AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGTGCA 780 

TCAGCTTCAG CAAGTACTAG CGCCTCAGCC TCAGCGTCGA CAAGCGCCTC AGCTTCAGCA 840 

AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG TACCTCAGCG 900 

TCTGAATCAG CATCAACAAG TGCGTCGGCT TCAGCATCAA CGAGTGCATC AGCTTCAGCA 960 

TCAACAAGTG CTTCAGCTTC AGCAAGTACC AGTGCGTCGG CTTCAGCATC AACGAGTGCT 1020 

TCAGTCTCAG CGTCAACCAG TGCCTCTGAA TCCGCATCAA CAAGTGCCTC GGCTTCAGCA 1080 

AGCACCAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTGCG 1140 

TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCTTCCGCA 1200 

TCAACAAGCG CCTCGGCCTC AGCAAGTACA AGTGCTTCAG CCTCAGCATC AACCAGTGCA 1260 

TCAGCTTCAG CCTCAACAAG TGCTTCAGCC TCAGCGTCAA CCAGTGCCTC GGCTTCAGCA 1320 

AGTACCAGTG CGTCAGCTTC AGCAAGCACA AGTGCGTCAG CTTCAGCATC AACCAGTGCT 1380 

TCGGCTTCGG CATCAACAAG TGCCTCAGCA TCAGCATCAA CGAGTGCGTC AGCCGG 1436 



(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

GCAGTTGCCA CACCGTGCTG ACCAGCACCC GTTCCTGCGA TAATTTTCTT TTTACCCATG 60 

CGTwTGGCAA GCCAAACTTG TCCTAAGGCA TTGTTAATCT TGTGGGCTCC TGTATGGTTA 120 

AGGTCTTCCC GTTTGAGATA AATCTTGCTC CGCCAATATG CTGGGTCAAG TTTTTTGCGT 180 

AATAAAGAGG AGTTTCACGT CCTACGTACT GGCGCAAAAG CTGGTTTAAT TCCTCTTGGA 240 

AACTTGGGTC TGCCTGACTT TCACGGTAGG CCTTCTCCAA CTCCAAAACT GCTGTCATCA 300 

ATGTTTCTGG GACAAAACGT CCGCCGAATT TTCCGTAAAA TCCATCTTTA TTTGGTTCCT 360 

GATATGCCAT GCTTTACCCT CTCTATAAAT CTTCTAATCT TTTCATGATC TTTTTGTCCA 420 

TCTGTCTCCA CTCCGCTCGA TACATCTACT GCATAGGGAG TAAAGTGTTG AATTGCTTTT 480 

ACTACATTAT CTTCATTAAG GCCACCTGCG ATAAAGAAGG GCTGTGCTAG TCCAGTCGTA 540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 



600 



TAATCTGCCT GAGAATTGGG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 



660 



GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 



720 



AAGCCGGGGA TCCTC 



735 



(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS : , 

(A) LENGTH: 1702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

TACTAGCGCC TCAGCCTCAG CGTCAACAAG TGCATCGGCT TCAGCATCAA CGAGTGCGTC 60 

CGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCATCGG CTTCAGCGTC 120 

AACGAGTGCG TCTGAGTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 180 

TGAATCTGCA TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG 240 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC 300 

AGCCTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CCTCAACCAG 360 

TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA AGTACTAGTG CATCAGCTTC 420 

AGCATCAACG AGTGCATCGG CTTCAGCATC AACCAGTGCA TCAGAGTCAG CAAGTACCAG 480 

TGCGTCAgCT TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 540 

AGCGTCAACA AGTGCTTCAG CTTCCGCGTC AACCAGCGCC TCGGCCTCAG CAAGTATCTC 600 

AGCGTCTGAA TCGGCATCAA CAAGTGCCTC GGCTTCAGCA TCAACGAGTG CATCAGTCTC 660 

AGCAAGCACC AGTGCGTCGG CCTCAGCAAG CACCAGCGCG TCTGAATCCG CATCAACCAG 720 

TGCCTCAGCT TCAGCAAGTA CCTCAGCATC TGAATCAGCA TCAACAAGTG CATCGGCT^C 780 

AGCAAGCACA AGTGCTTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 1260 

CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 144 0 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

-AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CG 170 2 
(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



ACCAGTGCAT 


CAGCTTCAGC 


CTCAACAAGT 


GCTTCAGCCT 


CAGCGTCAAC 


CAGTGCCTCG 


60 


GCTTCAGCAA 


GTACCAGTGC 


GTCACTTCAG 


CAAGCACAAG 


TGCGTCACTT 


CAGCATCAAC 


120 


CAGTGCTTCG 


GCTTCGGCAT 


CAACAAGTGC 


CTCAGCATCA 


GCATCAACGA 


GTGCGTCACC 


180 


TCAGCAAGTA 


CTAGTGCATC 


AGCATCAGCA 


TCAACCAGTG 


CATCAGCCTC 


AGCAAGTATC 


240 


TCAGCGTCTG 


AATCGGCATC 


AACGAGTGCA 


TCAGCATCAG 


CATCAACGAG 


TGCATCGGCT 


300 


TCAGCGTCAA 


CCAGTGCATC 


AGTCTCAGCA 


AGCACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


360 


AGTGCCTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCAGCC 


420 


TCAGCAAGTA 


CTAGTGCATC 


GGCTTCAGCA AGCACCAGTG 


CGTCGGCTTC 


AGCATCAACC 


480 


AGTGCCTCAG 


CCTCAGCAAG 


TATCTCAGCG 


TCTGAATCGG 


CATCAACGAG 


TGCGTCAGCC 


540 


TCAGCAAGTA 


CTAGTGCATC 


AGCATCAGCA 


TCAACGAGTG 


CATCGGCTTC 


AGCAAGTACC 


600 


AGCGCCTCAG 


CTTCAGCAAG 


CACCAGTGCG 


TCAGCCTCAG 


CAAGTACCAG 


CGCCTCAGCC 


660 


TCAGCAAGCA 


CCAGTGCCTC 


AGCTTCAGCA 


AGTACCAGTG 


CGTCAGCCTC 


AGCGTCGACA 


720 


AGTGCGTCGG 


CTTCAGCAAG 


TACCTCAGCG 


TCTGAATCAG 


CATCAACGAG 


TGCATCAGCT 


780 


TCAGCATCAA 


CAAGTGCTTC 


AGCTTCAGCA AGTACCAGTG 


CGTCGGCTTC 


AGCATCAACG 


840 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 900 
TCAGCAAGCA CCAGTGCGTC GGCTTCAGCA AGTACTAGTG C 941 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 



CAGCAAGTAC 


TAGTGCATCA 


GCTTCAGCAT 


CAACGAGTGC 


ATCGGCTTCT 


GCGTCAACCA 


60 


GTGCATCAGA 


GTCAGCAAGT 


ACCAGTGCGT 


CAGCTTCCGC 


ATCAACAAGT 


GCCTCGGCTT 


120 


CAGCAAGCAC 


CAGTGCGTCG 


GCTTCAGCAA 


GTACTAGCGC 


CTCAGCCTCA 


GCCTCAACCA 


180 


GTGCGTCAGC 


CTCAGCAAGT 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCGTCCGCTT 


240 


CAGCAAGTAC 


TAGCGCCTCA 


GCCTCAGCGT 


CAACAAGTGC 


ATCGGCTTCA 


GCGTCAACGA 


300 


GTGCGTCTGA 


ATCGGCATCA 


ACGAGTGCGT 


CGGCTTCAGC 


AAGTACTAGC 


GCCTCAGCCT 


360 


CAGCGTCAAC 


AAGTGCATCG 


GCTTCAGCAT 


CAACGAGTGC 


GTCCGCTTCA 


GCAAGTACTA 


420 


GCGCCTCAGC 


CTCAGCGTCA 


ACAAGTGCAT 


CGGCTTCAGC 


GTCAACGAGT 


GCGTCTGAGT 


480 


CAGCATCAAC 


GAGTGCGTCA 


GCCTCAGCAA 


GCACATCAGC 


TTCTGAATCT 


GCATCAACCA 


540 


GTGCGTCAGC 


CTCAGCATCG 


ACAAGCGCCT 


CAGCTTCAGC 


AAGTACCAGT 


GCGTCAGCCT 


600 


CAGCGTCGAC 


AAGTGCGTCG 


GCTTCAGCAA 


GTACCAGTGC 


GTCAGCCTCA 


GCAAGTACCA 


660 


GTGCGTCAGC 


CTCAGCGTCG 


ACAAGTGCGT 


CGGCCTCAAC 


CAGTGCATCT 


GAATCGGCAT 


720 


CAACCAGTGC 


GTCAGCCTCA 


GCAAGTACTA 


GTGCATCAGC 


TTCAGCATCA 


ACGAGTGCAT 


780 


CGGCTTCAGC 


ATCAACCAGT 


GCATCAGAGT 


CAGCAAGTAC 


CAGTGCGTCA 


GnTTCCGCAT 


840 


GCAACAAGTG 


CCTCGGCTTC 


AGCAAGTAC 








869 



(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 120 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 240 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA 360 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 420 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 480 

AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 540 

TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 



(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

CCGGAAAACA GCTCTGGCGC TTGGTCTTGC CCAGCGTATT GCTAGTGGTG ACGTGCCTGC 60 

GGAAATGGCT AAGATGCGCG TGTTAGAACT TGATTTGATG AATGTCGTTG CAGGGACACG 120 

CTTCCGTGGT GACTTTGAAG AACGCATGAA TAATATCATC AAGGATATTG AAGAAGATGG 180 

CCAAGTCATC CTCTTTATCG ATGAACTCCA CACCATCATG GGTTCTGGTA GCGGGATTGA 240 

TTCGACTCTG GATGCGGCCA ATATCTTGAA ACCAGCCTTG GCGCGTGGAA CTTTGAGAAC 300 

GGTTGGTGCC ACTACTCAGG AAGAATATCA AAAACATATC GAAAAAGATG CGGCACTTTC 360 

TCGTCGTTTC GCTAAAGTGA CGATTGAAGA ACCAAGTGTG GCAGATAGTA TGACTATTTT 420 

ACAAGGTTTG AAGGCGACTT ATGAGAAACA TCACCGTGTA CAAATCACAG ATGAAGCGGT 480 

TGAAACAGCG GTTAAGATGG CTCATCGTTA TTTAACCAGT CGTCACTTGC CAGACTCTGC 540 

TATCGATCTC TTGGATGAGG CGGCAGCAAC AGTGCAAAAT AAGGCAAAGC ATGTAAAAGC 600 

AGACGATTCA GATTTGAGTC CAGCTGACAA GGCCCTGATG GATGGCAAGT GGAAACAGGC 660 
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AGCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGACTTGG TGACAGAGTC 720 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTATGTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 



(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



CAAAGCGCCT 


CAGCTTCAGC 


ATCAACAAGT 


GCGTCGGCTT 


CAGCATCAAC 


CAGTGCCTCG 


60 


GCTTCAGCGT 


CAACCAGTGC 


GTCACATTCA 


GCAAGTACCA 


GTGCTTCAGT 


CTCAGCATCA 


120 


ACAAGTGCTT 


CAGCCTCAGC 


ATCGACAAGT 


GCCTCGGCTT 


CAGCAAGCAC 


ATCAGCATCT 


180 


GAATCAGCGT 


CAACCAGTGC 


TTCGGCTTCA 


GCAAGTACCA 


GTGCTTCAGC 


TTCAGCATCA 


240 


ACCAGCGCCT 


CGGCCTCAGC 


AAGCACCTCA 


GCTTCTGAAT 


CGGCCTCAAC 


CAGCGCCTCG 


300 


GCCTCAGCAA 


GCACCTCAGC 


TTCTGAATCG 


GCCTCAACCA 


GCGCCTCAGC 


CTCAGCATCA 


360 


ACGAGTGCTT 


CGGCTTCAGC 


AAGCACAAGC 


GCCTCGGGTT 


CAGCATCAAC 


GAGTACGTCA 


420 


GCTTCAGCGT 


CAACCAGTGC 


TTCAGCCTCA 


GCATCAACAA 


GTGCGTCAGC 


CTCAGCAAGT 


480 


ATCTCAGCGT 


CTGAATCGGC 


ATCAACGAGT 


GCGTCTGAGT 


CAGCATCAAC 


GAGTACGTCA 


540 


GCCTCAGCAA 


GCACCTCAGC 


TTCTGAATCG 


GCCTCAACCA 


GTGCGTCAGC 


CTCAGCATCG 


600 


ACAAGCGCCT 


CAGCTTCAGC 


AAGTACCAGT 


GCTTCAGCCT 


CAGCGTCGAC 


AAGTGCGTCG 


660 


GCCTCAACCA 


GTGCATCTGA 


ATCGGCATCA 


ACCAGTGCGT 


CAGCCTCAGC 


AAGTACTAGT 


720 


GCATCGGCTT 


CAGCATCAAC 


CAGTGCCTCG 


GCTTCAGCGT 


CAACCAGTGC 


GTCAGCTTCA 


780 


GCAAGTACCA 


TGTGCTTCAT 


GTCTCAG 








807 



(2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 
CATCGGCTTC AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTACCGCC TCAGCCTCAG - 60 

CGTCAACAAG TGCATCGGCT TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTG 120 

CGTCACCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CACCTCAGCA 180 

TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAC CTCAGCGTCG ACAAGTGCGT 240 

CGGCTTCAGC AAGTACCAGT GCGTCAsCTC AGCAAGTACC AGTGCGTCAC CTCAGCGTCG 300 

ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT CAACCAGTGC GTCACCTCAG 360 

CAAGTACTAG TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCAGCA TCAACCAGTG 420 

CATCAGAGTC AGCAAGTACC AGTGCGTCAG cTTCCGCATC AACAAGTGCC TCGGCTTCAG 480 

CAAGTACTAG CGCCTCAGCC TCAGCGTCAA CAAGTGCTTC AGCTTCCGCG TCAACCAGCG 540 

CCTCGGCCTC AGCAAGTATC TCAGCGTCTG AATCGGCATC AACAAGTGCC TCGGCTTCAG 600 

CATCAACGAG TGCATCAGTC TCAGCAAGCA CCAGTGCGTC GGCCTCAGCA AGCACCAGCG 660 

CGTCTGAATC CGCATCAACC AGTGCCTCAG CTTCAGCAAG TACCTCAGCA TCTGAATCAG 720 

CATCAACAAG TGCATCGGCT TCAGCAAGCA CAAGTGCTTC AGCCTCAGCA AGTATCTCAG 780 

CGTCTGAATC GGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG 840 

CGTCAACAAG TGCTTCGGCT TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA 900 

CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG 960 

CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG 1020 

CTTCAGCCTC AGCGTCGACA AGTGCGTCGG GCTCAACCAG TGCATCTG 1068 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 120 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 180 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 240 
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CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 300 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 420 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA 620 



(2) INFORMATION FOR SEQ ID NO: 375: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



GTATTGGGGC 


GCCCCAACCT 


CTATGTGACT 


ACGGATTATT 


TCCTAGATTA 


CATGgGGATA 


60 


AACCATTTAG 


AAGAATTACC 


AGTGATTGAT 


GAGCTTGAGA 


TTCAAGCCCA 


AGAAAGCCAA 


120 


TTATTTGGTG 


AAAGGATAGA 


AGAAGATGAG 


AATCAATAAG 


TATATTGCCC 


ACGCAGGTGT 


180 


GGCCAGTAGG 


AGAAAAGCAG 


AAGAGCTGAT 


TAAGCAAGGC 


TTGGTGACGG 


TTAACGGCCA 


240 


AGTGGTGCGT 


GAACTAGCAA 


CCACTATCAA 


GTCAGGCGAC 


AAGGTCGAAG 


TTGAAGGTCA 


300 


ACCTATCTAC 


AACGAAGAAA 


AGGTCTACTA 


TCTGCTTAAC 


AAACCACGCG 


GTGTGATTTC 


360 


CAGTGTGACA 


GATGATAAGG 


GTCGCAAGAC 


GGTTGTCGAC 


CTCTTGCCCA 


ATGTCAAAGA 


420 


GCGTATTTAC 


CCTGTGGGTC 


GTTTGGACTG 


GGATACATCA 


GGTGTCTTGA 


TTTTGACCAA 


480 


TGATGGGGAC 


TTTACAGACG 


AGATGATTCA 


CCCTCGTAAT 


GAGATTGACA 


AGGTTTATGT 


540 


CGCGCGTGTT 


AAAGGTGTGG 


CCAATAAGGA 


CAATCTCCGC 


CCCTTGACCC 


GTGGTCTTGA 


600 


GATTGATGGT 


AAGAAAACCA 


AGCCATAATA 


TATAGGTTTT 


GTAGCCTCTA 


CACCATAAAT 


660 


ATTTGCTAAT 


AAAAATACTG 


TATTATTACC 


CTCTTAAGGT 


GCGAAATTAT 


TCAAGTTCTT 


720 



(2) INFORMATION FOR SEQ ID NO; 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 
CGCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 120 

CGATGTCAAG GGGGGCGCAC ACTskATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 300 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 3 60 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 420 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAAGCTTCA TCTGGAACCC 4 80 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 540 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 648 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

GTGCATCGCT TTCAGCATCG ACAAGTGCGT CTGAATCGGC ATCAACGAGT GCTTCGGCTT 60 

CAGCATCAAC GAGTGCGTCA GCTTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 120 

GTGCGTCCGC TTCAGCGTCA ACCAGTGCGT CGGCTTCAGC GTCGACAAGT GCTTCGGCTT 180 

CAGCATCAAC GAGTGCGTCG GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 240 

CAACCAGTGC GTCCGCTTCA GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 300 

CTGAATCGGC ATCAACGAGT GCGTCGGCCT CAGCAAGCGC AAGTACCTCA GCGTCAGCTT 360 

CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC GTCAGCCTCA GCAAGTATCT 420 

CAGCGTCTGA ATCGGCATCA ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT 480 

CAGCAAGCAC ATCAGCTTCT GAATCGGCAT CAACCAGTGC GTCAGCCTCA GCATCGACAA 540 

GCGCCTCAGC TTCAGCAAGT ACCAGTGCTT CAGCCTCAGC GTCGACAAGT GCGTCGGCCT 600 

CAACCAGTGC ATCTGAATCG GCATCAACCA GTGCGTCAGC CTCAGCAAGT ACTAGTGCAT 660 
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CAGCTTCAGC ATCAACGAGT GCATCGGCTT 690 
(2) INFORMATION FOR SEQ ID NO: 378: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

CGAGATTCTC TGGAGTTATG GATGTCGTTC CAATATGTGC ACGTTGGAAT GTTAGTGCTT 60 

ATATGGGGGG AACAGAATCC TCTCTTGATT GAAGACAAGC TAGTCATTAG GCTGGTTTGT 120 

CTTTTTGTCA ACTGTAGTGG GTTGATATAA TAGTATTAGT GAGTGGGATA AAAGTTTCAT 180 

TTAGTTTATT CAGTACAAAT TTAACGGGTC AAGATTTATA TACTAGTGGT GTTTTTGGGG 240 

CTGAGAGAAG TATCTTGATT TTATGTGTGG TTTTTATACT TACAGTTGTT CTGCTCCAAA 300 

GAGCTTGTAG AGAAGAATTA GCTCATAAAG GAGATTGATT ATTTTGATAT CAAAAAAATG 360 

CACAGGATAA CCTGATGCAT TTTTTTAGCG ACAATGCTTG CTACTTCCTT CTGTCGAATT 420 

TAGACAATTT TAAACCCCAA TTATTCACCC CAAATCTAAA AACCATCCAG AATCCTTGCC 480 

TTAGCTTAGA TCCTGGATGG TTTCTTTTTT CACCCAATGG GTGTTTTTTA CTAGACAAAA 540 

AAGAGTTTCC CCTTTATGGT ATAAGTGTAG AAAAAAACAC AAAAAGAAAG GAAACTCACA 600 

TGAACAGTTT AC CAAAT CAT CACTTCCAAA ACAAGTCTTT TTACCAACTA TCTTTCGATG 660 

GAGGTCATTT AACCCAGTAT GGTGGTCTTA TCTTTTTTCA GGAACTTTTT TCCCAGTTGA 720 

AACTAAAAGA GCGGATTTCT AAGTATTTAG TAACGAATGA CCAACGCCGC TACTGTCGTT 780 

ATTCGGATTC AGATATCCTT GTCCAGTTCC TCTTTCAACT GTTAACAGGT TATGGAACGG 840 

ACTATGCTTG TAAAGAATTG TCAGCTGATG CCTACTTTCC AAAATTATTG GAAGGAGGGC 900 

AGCTTGCTTC ACAGCCAACC TTATCCCGTT TTCTTTCCAG AACTGACGAG GAAACAGTCC 960 

ATAGTTTGCG ATGCCTCAAC CTTGAATTGG TCGAATTCTT TTT 1003 
(2) INFORMATION FOR SEQ ID NO: 379: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTGGGAA TTTTGAGGTA GATCTATGAT 60 

TGAAATACTA ATTGTTTTAG CTATTATCCT ATCTCTTGCT TTGATTGTAT TGGTAACTAT 120 

ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 180 

CTACTGGCAG AGCAACACCT TGGTCAAGGT GCTCACTTTA TTGGTGAGTT TGGCTTTATT 240 

TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 300 

CCTTTTGTGG ATTGGTCTGA AGTTTTCTTT TTTATACTCA ATGAAAATCA AAGAGCAAAC 360 

TAGGAAGCTA GCCGCAckGC TCAAAACACC GTTTTGAGGT TGTAGATATA ACTGACGAGc 420 

GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 480 

TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 540 

CTGACGAAGT CGcTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 600 

ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGACG 660 

CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGGT AAAAATCAAC TTTTACTTGG 720 

ATGAAGGTTT TTTTTTTT 738 



(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 120 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 240 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAACGC GCTTTCTTGG 360 

CTTGAGCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 420 

GAAGGGCTGT AATCCCATCA CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 480 

CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 540 

TAGCAATACC AGCTACTGGC GCCTTGATTG GCACACCACC AGCCATAAGG GC AAGAGTTC 600 
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CCGCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 660 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 695 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA GTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAACCATC 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGTGTTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 360 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 420 

TTTTCCCAGT TGAAACTAAA AGAGCGGATT TCTAAGTATT TAGTAACGAA TGACCAACGC 480 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 540 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 660 

GAGGAAACAG TCCATAGTTT GCGATGCCTC A 691 



(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT GTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 120 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 240 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 300 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG GACTAGAATG 360 

CTTTTTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 420 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 480 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC CTTTTCTGTA 600 

CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CGTTTATATA ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 720 

GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 750 



(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



TCAAATTCTT 


CGTGGTCCGC 


ATATCTnTCT 


TCGTACACGG 


CAGTCACTTG 


GTCTTTCACT 


60 


ACTCGAGTCG 


CAGCTTCACG 


GGCCAATTTC 


TCTTCTACTT 


GAACTGCCTT 


TTGGAGGTCA 


120 


CTGTTGTAGG 


CTGCAATGAT 


TTCAGCTTGC 


AATTCAGCAT 


CCACGTGAAG 


CAATTCCACT 


180 


TCTGCTTTTT 


CTTTACCGAC 


AGCAGCAACG 


ATTTCTTCTT 


GGAAGGCAAT 


CAATTCTTTG 


240 


ACAGCTTCGT 


GCCCTTTAAG 


GAGCGCTTCC 


AACATGATTT 


CTTCTGACAA 


TTCTTTGGCA 


300 


CCAGACTCTA 


CCATGTTGAT 


AGCGTGCTTG 


GTTCCAGCTA 


CTGTCAATTC 


AAGAAGAGAT 


360 


TGCTCTGCTT 


GTTCTTGACT 


TGGGTTGATG 


ATGATTTGGC 


CATCTACATA 


TCCCACTTGT 


420 


ACCCCAGCAA 


TTGGTCCGTC 


AAATGGAATA 


TCTGAAATAG ACAGTGCCAA 


AGATGAACCA 


480 


AACATAGCAG 


CCATTGGTGC 


AGATGCATTT 


TCATCATAAG 


AAAGCACTGT 


ATTGATGACT 


540 


TGGACTTCAT 


TACGGAAACC 


TTCCGCAAAC 


ATAGGACGAA 


TCGGACGGTC 


AATCAAACGC 


600 


GCTGTCAAGG 


TCGCATCTGT 


TGAAGGACGT 


CCTTCACGTT 


TCATAAAGCC 


ACCAGGAAAC 


660 


TTCCCAGCCG 


CATACATTTT 


TTCTTCGTAG 


TTGACTTGGA 


GTGGGAAGAA 


ATCCTCAGTT 


720 


GCCATTTTCT 


GGGGATCC 










738 



(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 



CCCCCTATTT 


ACCGTGGACT 


AAAGTTGTAC 


AAGAAAAGTG 


CAAATAAGAA 


ATCTCCAGAT 


60 


TAGGAACTAT 


ATATGAGTTC 


TCTAGTCTGG 


AGATTTTTCA 


ATAGACTTCG 


TTATTGGGCG 


120 


GTTACTTTCG 


AAACTTTGAA 


AACTTCAAAA 


AACGGATTTT 


TATCGCTTTC 


AAATTCTTTT 


180 


GGGGTCAAAC 


TCAGTAACTT 


ATTCGCCTTG 


TAGACTTCAT 


GACGCTCAGG 


GTATACTTTC 


240 


AAGGTCCCAA 


ATAGCCAAGA 


ATCGTCAGCG 


ATATTATCTG 


AATCATCTCC 


TTCTTGTTCT 


300 


CCTTTAGTTC 


GCCTGAGGAC 


AGCCTTGACA 


CGCGCCAGAA 


TTCTCTAGGG 


CTAAAAGGCT 


360 


TGGTCAGGTA 


GTCATCAGCC 


CCTAATTCCA 


AGGCCAAAAC 


CTTATCAAAT 


TCATCACTTT 


420 


TCGCAGAAAC 


CATCATAATT 


GGAGTTTTGA 


CGCCTTTGGC 


TCTCAGCCGC 


TTACAAACTT 


480 


CCATGCCATC 


TAATTGTGGT 


AACATGATAT 


CAAGCAAGAT 


AAAATCAAAG 


GGTTCTGTTT 


540 


CTGCCAAAGC 


TAAGGCCTTC 


CGTCCATTTG 


TCACCAATTG 


AGTAGAAAAG 


CCTTCCTTAC 


600 


TTAAATGGTA 


GTCAAGCAAT 


TTCAGAATGT 


GTTCTTCATC 


ATCCACTAAT 


AAGACTT 


657 



(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAG TCTCAGCAAG 60 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 120 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAAGTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACCAGT GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCAGCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 420 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCAGCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 540 
AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 586 
(2) INFORMATION FOR SEQ ID NO: 386: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

CGTCGGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 60 

CAAGTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG 120 

CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 180 

CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 240 

TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAAGCGCCT CGGCCTCAGC 300 

AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 360 

TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTcAGCA 420 

AGCACAAGTG CGTCAATTTA GCATCAACCA G 451 



(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTGAA TCCGCATCAA * 60 

CCAGTGCTTC AGCTTCAGCC AAGTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 180 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 

GCTTCAGCGT CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCGTCG 420 
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ACAAG 425 
(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTAGT 60 

AGACAAACTC TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA TTTTTCTTTT TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 360 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC kAGTkTGCTC 480 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGkCTCCA AAAAAkAGCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 



(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAAcAwGTGC TTCGGCTTCA GCGTCAACGA GTGCGTCTGA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 

CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 

CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 360 

GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA 480 
GCGCCTCAGC TTCAGCAAGC ACCGG 505 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 120 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT CTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA ACAGTATATC ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 300 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 360 

AGTGTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT GACGAAGTCG GTACCGA 447 



(2) INFORMATION FOR SEQ ID NO: 391: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

AGCACTTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTTCTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 120 

AGCTATAATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 300 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAGAGCTAAA 360 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GGCTGCTAAA ATATTTCTAT 420 
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AAATTAATTT GACTTTCCTG ATAGAGATGT TCACATCTTA TTTCAAACTA CTATATAAGT 480 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 5*72 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a third party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the fiornishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for the International 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occurs earlier. 
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SINGAPORE 



The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant m the 
individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest m the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected in the art. 
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What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS: 1-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1 -39 1 ; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS:l-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 
degenerate variant thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5' to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalently attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 
to any one of the fragments of the Streptococcus pneumoniae genome depicted in 
SEQ ID NOS: 1-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a homolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ED NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 
comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

b) identifying members of said library which contain sequences that 
hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 



16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

17. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS:l-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 

18. An isolated polynucleotide molecule encoding any one of the 
polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

130 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 

135 pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Tables 2 and 3, under 

conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 
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Figure 2 
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